Quantcast
Channel: The Insiders: Journeys in SQL - metadata
Viewing all articles
Browse latest Browse all 10

Index metadata

$
0
0

Ok, so it's been a while, but we're finally onto index metadata. We have three types of indexes in SQL Server, Relational, XML, Spatial. Okay, so that's really only three types of index and last time I said there were four, but the metadata for statistics is very similar, and we'll cover that next time. Our main table for indexes is [sys].[indexes] - with further information available in [sys].[xml_indexes] and [sys].[spatial_indexes]. For the columns participating in indexes, we will look at [sys].[index_columns].

So let's look at the most basic data first, from [sys].[indexes]:

object_idThis is the ID of the object that the index is created on.
nameThis is the name of the index. Each object cannot have more than index with the same name, but indexes on different objects can have the same names. If this is NULL, then the index represents a heap, which means the table has no clustered index - this is a universally bad thing.
index_idThis is the ID of the index - and is unique per object. However, the index ID also has special meaning:
  • 0 - Heap
  • 1 - Clustered index
  • 2+ - Nonclustered index
  • 256,000+ - XML index
  • 384,000+ - XML index
It is interesting to note these values - the numeric space given far exceeds the specified maximum numbers of indexes per object, and they are not even convenient bit masks.
typeThis column gives us an explicit type reference for the index:
  • 0 - Heap
  • 1 - Clustered
  • 2 - Nonclustered
  • 3 - XML
  • 4 - Spatial
type_descThis is the description that relates to the value in the type column.
is_uniqueThis will be 1 if the index enforces uniqueness, either because it was created using CREATE UNIQUE INDEX, or because it is enforcing a unique or primary key constraint.
data_space_idThis is the ID of the data space in which the data for the index is stored. Note that because heaps and clustered indexes actually store the data of the underlying object, these indexes tell us where the actual data is stored.
ignore_dup_keyThis is 1 if the IGNORE_DUP_KEY option is ON for the index.
is_primary_keyThis is 1 if the index was created to enforce a PRIMARY KEY constraint. These types of indexes cannot be created directly, they are created by using ALTER TABLE to create the relevant type of constraint.
is_unique_constraintThis is 1 if the index was created to enforce a UNIQUE constraint. These types of indexes cannot be created directly, they are created by using ALTER TABLE to create the relevant type of constraint.
fill_factorIf this is non-zero, then it represents the specified FILLFACTOR when the index was created or updated. Fill factor represents a percentage of how full an index page will be when the index is newly created or rebuilt. If this is zero, then the database default value is used. The database default value is typically 0, which, interestingly, means 100 (i.e. full pages), but can be changed using sp_configure.
is_paddedIf this is 1, then PAD_INDEX was specified for the index, which means that the intermediate or leaf nodes of the index will also be spaced according to the fill_factor value.
is_disabledIf this is 1 then the index is disabled.
is_hypotheticalIf this is 1, then the index is hypothetical, which means it can't actually be used for any useful querying. If you're wondering, hypothetical indexes are created by the database tuning advisor (DTA).
allow_row_locksThis is 1 if the index allows row level locks.
allow_page_locksThis is 1 if the index allows page level locks.
has_filterFrom SQL Server 2008 onwards, indexes could effectively have a WHERE clause specified for them, allowing indexes to be created on partial data. If the index has such a filter, then this column is 1.
filter_definitionIf the index has a filter, this contains the definition of it, otherwise this is NULL.

Ok, cooking with gas. We've got a lot of information above, which enables us to specify everything we need to in order to re-create or understand indexes, apart from the columns that are present. So, let's look at the [sys].[index_columns] table to look at what we get there:

object_idThis is the ID of the object that the index is created on.
index_idThis is the same as the index_id column from [sys].[indexes].
index_column_idThis is the ID of the column within in the index (i.e. giving a sequential number to the columns involved).
column_idThis is the ID of the indexed column, and can be matched with [sys].[columns] for any given object_id value.
key_ordinalThis gives the order in which the index key columns are specified - sorting the rows by this value gives you the key columns in the right order.
partition_ordinalThis gives you order in which the columns are involved in partitioning for the index. Because partition functions currently only accept one parameter, this is either 0 or 1 (i.e. is not or is the partitioning key), but support for multiple parameter partition functions could be included in the future.
is_descending_keyIf this is 1, then the index is arranged with this key descending.
is_included_columnIf this is 1, then the column is an included column, and is present only on the data page level of the index. Applies to non-clustered indexes only

That's given us everything we need to understand and re-create indexes. We can specify the key column list, with appropriate ordering, the inclusion list and even the partitioning key. So what extra information is available for XML indexes? Let's have a look at the extra columns we get in [sys].[xml_indexes] over and above what we get in [sys].[indexes]:

using_xml_index_idFor secondary XML indexes, a primary XML index is needed, and this column tells us what primary XML index the secondary index is referencing. For primary XML indexes, this column will contain NULL.
secondary_typeThis tells us the type of secondary XML index, if it is a secondary XML index:
  • P - PATH secondary XML index
  • V - VALUE secondary XML index
  • R - PROPERTY secondary XML index
If this is a primary XML index, then this column contains NULL.
secondary_type_descThis is the description that relates to the value in the secondary_type column.

So, there's not actually that much extra there - but it does cover what we need to know in order to accurately represent and recreate XML indexes.

What about spatial indexes? They have a lot of extra syntax over and above normal index creation, so we'd expect to find a lot of extra information in [sys].[spatial_indexes] over and above what we get in [sys].[indexes]:

spatial_index_typeThis is the type of spacial index:
  • 1 - Geometric spatial index
  • 2 - Geographic spatial index
spatial_index_type_descThis is the description that relates to the value in the spatial_index_type_desc column.
tessellation_schemeThis is the style of the tessellation scheme, either GEOMETRY_GRID or GEOGRAPHY_GRID. GEOMETRY_GRID is a bounded grid in flat space, and GEOGRAPHY_GRID is an unbounded grid in curved space (accounting for the shape of the Earth).

Hang on a minute. That wasn't the tasty snack we were expecting! Where is the information about bounding boxes for geometry grids, and the grid densities? For some unfathomable reason, this information is stored in [sys].[spatial_index_tessellations] - let's take a look:

object_idThis is the ID of the object that the index is created on.
index_idThis is the same as the index_id column from [sys].[indexes].
tessellation_schemeThis is the style of the tessellation scheme, either GEOMETRY_GRID or GEOGRAPHY_GRID. GEOMETRY_GRID is a bounded grid in flat space, and GEOGRAPHY_GRID is an unbounded grid in curved space (accounting for the shape of the Earth).
bounding_box_xminWhen using a GEOMETRY_GRID, these four values will define the bounding box in which the spatial index is defined. When using a GEOGRAPHY_GRID, the four values will be NULL.
bounding_box_ymin
bounding_box_xmax
bounding_box_ymax
level_1_gridGrid density appropriate grid level, and is one of the following values:
  • 16 = 4 by 4 grid
  • 64 = 8 by 8 grid
  • 256 = 16 by 16 grid
level_2_grid
level_3_grid
level_4_grid
level_1_grid_descDescription of the grid density for the appropriate grid level, either LOW, MEDIUM or HIGH.
level_2_grid_desc
level_3_grid_desc
level_4_grid_desc
cells_per_objectThis is the number of cells per object, as specified in the index DML.

That's better - now we've got the information we need for spatial indexes too. Statistics up next!


Viewing all articles
Browse latest Browse all 10

Trending Articles