Managing Indexes

Administrator’s Guide
Red Hat Directory Server

Index

Chapter 10

Managing Indexes

The Red Hat Directory Server Deployment Guide introduced the concept of indexing, the costs and benefits, and different types of index shipped with Red Hat Directory Server (Directory Server). This chapter begins with a description of the searching algorithm itself, so as to place the indexing mechanism in context, and then describes how to create, delete, and manage indexes. This chapter contains the following sections:

About Indexes (page 391)
Creating Indexes (page 400)
Deleting Indexes (page 410)
Managing Indexes (page 418)
Attribute Name Quick Reference Table (page 422)

About Indexes

This section provides an overview of indexing in Directory Server. It contains the following topics:

About Index Types
About Default, System, and Standard Indexes
Overview of the Searching Algorithm
Balancing the Benefits of Indexing

About Index Types

Indexes are stored in files in the directory's databases. The names of the files are based on the indexed attribute, not the type of index contained in the file. Each index file may contain multiple types of indexes if multiple indexes are maintained for the specific attribute. For example, all indexes maintained for the common name attribute are contained in the cn.db3 file.

Directory Server supports the following types of index:

Presence index (pres) - The presence index contains a list of the entries that contain a particular attribute. This index is useful if, for example, you want to examine any entries that contain access control information. Generating an aci.db3 file that includes a presence index lets you efficiently perform the search for ACI=* to generate the Access Control List for the server.

The presence index is not used for base object searches.

Equality index (eq) - The equality index allows you to search efficiently for entries containing a specific attribute value. For example, an equality index on the cn attribute allows a user to perform the search for cn=Babs Jensen far more efficiently.
Approximate index (approx) - The approximate index allows efficient approximate or "sounds-like" searches. For example, an entry may include the attribute value cn=Robert E Lee. An approximate search would return this value for searches against cn~=Robert Lee, cn~=Robert, or cn~=Lee. Similarly, a search against l~=San Fransisco (note the misspelling) would return entries including l=San Francisco.
Substring index (sub) - The substring index is a costly index to maintain, but it allows efficient searching against substrings within entries. Substring indexes are limited to a minimum of three characters for each entry.

For example, searches of the form:
cn=*derson
 
would match the common names containing strings such as:
Bill Anderson

Jill Henderson

Steve Sanderson
 
Similarly, the search for:
telephonenumber= *555*
 
would return all the entries in your directory with telephone numbers that contain 555.

International index - The international index speeds up searches for information in international directories. The process for creating an international index is similar to the process for creating regular indexes, except that you apply a matching rule by associating a locale (OID) with the attributes to be indexed.

For a listing of supported locales and their associated OIDs, refer to Appendix D, "Internationalization." If you want to configure the Directory Server to accept additional matching rules, contact Red Hat Professional Services.

Browsing (virtual list view) index - The browsing index, or virtual list view index, speeds up the display of entries in the Directory Server Console. This index is particularly useful if a branch of your directory contains hundreds of entries; for example, the ou=people branch. You can create a browsing index on any branchpoint in the directory tree to improve display performance. You do this through the Directory Server Console or by using the vlvindex command-line tool, which is explained in the Red Hat Directory Server Configuration, Command, and File Reference.

About Default, System, and Standard Indexes

When you install Directory Server, a set of default and system indexes is created per database instance. To maintain these indexes, the directory uses standard indexes.

Overview of Default Indexes

The default indexes can be modified depending on your indexing needs, although you should ensure that no server plug-ins or other servers in your enterprise depend on this index before you remove it.

Table 10-1 lists the default indexes installed with the directory.

Table 10-1 Default Indexes

Attribute

Eq

Pres

Sub

Purpose

cn

X

X

X

Improves the performance of the most common types of user directory searches.

givenName

X

X

X

Improves the performance of the most common types of user directory searches.

mail

X

X

X

Improves the performance of the most common types of user directory searches.

mailHost

X

-

-

Used by a messaging server.

member

X

-

-

Improves Directory Server performance. This index is also used by the Referential Integrity Plug-in. See "Maintaining Referential Integrity," on page 75, for more information.

owner

X

-

-

Improves Directory Server performance. This index is also used by the Referential Integrity Plug-in. See "Maintaining Referential Integrity," on page 75, for more information.

seeAlso

X

-

-

Improves Directory Server performance. This index is also used by the Referential Integrity Plug-in. See "Maintaining Referential Integrity," on page 75, for more information.

sn

X

X

X

Improves the performance of the most common types of user directory searches.

telephoneNumber

X

X

X

Improves the performance of the most common types of user directory searches.

uid

X

-

-

Improves Directory Server performance.

uniquemember

X

-

-

Improves Directory Server performance. This index is also used by the Referential Integrity Plug-in. See "Maintaining Referential Integrity," on page 75, for more information.

Overview of System Indexes

System indexes are indexes that cannot be deleted or modified. They are required by the directory to function properly. Table 10-2 lists the system indexes included with the directory.

Table 10-2 System Indexes

Attribute

Eq

Pres

Purpose

aci

-

X

Allows the Directory Server to quickly obtain the access control information maintained in the database.

dnComp

X

-

Used to help accelerate subtree searches in the directory.

objectClass

X

-

Used to help accelerate subtree searches in the directory.

entryDN

X

-

Speeds up entry retrieval based on DN searches.

parentID

X

-

Enhances directory performance during one-level searches.

numSubordinates

-

X

Used by the Directory Server Console to enhance display performance on the Directory tab.

nsUniqueID

X

-

Used to search for specific entries.

Overview of Standard Indexes

Because of the need to maintain default indexes and other internal indexing mechanisms, the Directory Server also maintains certain standard index files. The following standard indexes exist by default, and you do not need to generate them:

id2entry.db4 - Contains the actual directory database entries. All other database files can be recreated from this one.
id2children.db4 - Restricts the scope of one-level searches, searches that examine an entry's immediate children.
dn.db4 - Controls the scope of subtree searches; searches that examine an entry and all the entries in the subtree beneath it.
dn2id.db4 - Begins all searches efficiently by mapping an entry's distinguished name to its ID number.

Overview of the Searching Algorithm

Indexes are used to speed up searches. To understand how the directory uses indexes, it helps to understand the searching algorithm. Each index contains a list of attributes (such as the cn, common name, attribute) and a pointer to the entries corresponding to each value. Directory Server processes a search request as follows:

An LDAP client application, such as the Directory Server Gateway, sends a search request to the directory.
The directory examines the incoming request to make sure that the specified base DN matches a suffix contained by one or more of its databases or database links.

If they do match, the directory processes the request.
If they do not match, the directory returns an error to the client indicating that the suffix does not match. If a referral has been specified in the nsslapd-referral attribute under cn=config, the directory also returns the LDAP URL where the client can attempt to pursue the request.

If the search request for each database attribute can be satisfied by a single index, then the server reads that index to generate a list of potential matches.

If there is no index for the attribute, the directory generates a candidate list that includes all entries in the database, which makes the search considerably slower.

If a search request contains multiple attributes, the directory consults multiple indexes and then combines the resulting lists of candidate entries.

If there is an index for the attribute, the directory takes the candidate matches from the index files in the form of a series of entry ID numbers.
The directory uses the returned entry ID numbers to read the corresponding entries from the id2entry.db3 file. The Directory Server then examines each of the candidate entries to see if any match the search criteria. The directory returns matching entries to the client as each is found.

The directory continues until either it has examined all candidate entries or it reaches the limit set in one of the following attributes:

nsslapd-sizelimit which specifies the maximum number of entries to return from a search operation. If this limit is reached, the directory returns any entries it has located that match the search request, as well as an exceeded size limit error.
nsslapd-timelimit which specifies the maximum number of seconds allocated for a search request. If this limit is reached, the directory returns any entries it has located that match the search request, as well as an exceeded time limit error.
nsslapd-lookthroughlimit which specifies the maximum number of entries that the directory will check when examining candidate entries in response to a search request.
nsslapd-idlistscanlimit which specifies the maximum number of entries in an ID list before the list is considered to equal the entire database.

See Red Hat Directory Server Configuration, Command, and File Reference for further information about these attributes.

In addition, the directory uses a variation of the metaphone phonetic algorithm to perform searches on an approximate index. Each value is treated as a sequence of words, and a phonetic code is generated for each word.

Note

The metaphone phonetic algorithm in Directory Server supports only US-ASCII letters. Therefore, use approximate indexing only with English values.

Values entered on an approximate search are similarly translated into a sequence of phonetic codes. An entry is considered to match a query if both of the following are true:

All of the query string codes match the codes generated in the entry string.
All of the query string codes are in the same order as the entry string codes.

For example:

Name in the Directory (Phonetic Code)

Query String (Phonetic code)

Match Comments

Alice B Sarette
(ALS B SRT)

Alice Sarette
(ALS SRT)

Matches. Codes are specified in the correct order.

Alice Sarrette
(ALS SRT)

Matches. Codes are specified in the correct order, despite the misspelling of Sarette.

Surette
(SRT)

Matches. The generated code exists in the original name, despite the misspelling of Sarette.

Bertha Sarette
(BR0 SRT)

No match. The code BR0 does not exist in the original name.

Sarette, Alice
(SRT ALS)

No match. The codes are not specified in the correct order.

Balancing the Benefits of Indexing

Before you create new indexes, balance the benefits of maintaining indexes against the costs. Keep in mind that:

Approximate indexes are not efficient for attributes commonly containing numbers, such as telephone numbers.
Substring indexes do not work for binary attributes.
Equality indexes should be avoided if the value is big (such as attributes intended to contain photographs or passwords containing encrypted data).
Maintaining indexes for attributes not commonly used in a search increases overhead without improving global searching performance.
Attributes that are not indexed can still be specified in search requests, although the search performance may be degraded significantly, depending on the type of search.
The more indexes you maintain, the more disk space you will require.

The following example illustrates exactly how time-consuming indexes can become. Consider the procedure for creating a specific attribute:

The Directory Server receives an add or modify operation.
The Directory Server examines the indexing attributes to determine whether an index is maintained for the attribute values.
If the created attribute values are indexed, then the Directory Server generates the new index entries.
Once the server completes the indexing, the actual attribute values are created according to the client request.

For example, suppose the Directory Server is asked to add the entry
dn: cn=John Doe, ou=People,dc=example,dc=com

objectclass: top

objectClass: person

objectClass: orgperson

objectClass: inetorgperson

cn: John Doe

cn: John

sn: Doe

ou: Manufacturing

ou: people

telephonenumber: 408 555 8834

description: Manufacturing lead for the Z238 line of widgets.
 
Further suppose that the Directory Server is maintaining the following indexes:

Equality, approximate, and substring indexes for common name and surname attributes.
Equality and substring indexes for the telephone number attribute.
Substring indexes for the description attribute.

Then to add this entry to the directory, the Directory Server must perform these steps:

Create the common name equality index entry for John and John Doe.
Create the appropriate common name approximate index entries for John and John Doe.
Create the appropriate common name substring index entries for John and John Doe.
Create the surname equality index entry for Doe.
Create the appropriate surname approximate index entry for Doe.
Create the appropriate surname substring index entries for Doe.
Create the telephonenumber equality index entry for 408 555 8834.
Create the appropriate telephonenumber substring index entries for 408 555 8834.
Create the appropriate description substring index entries for Manufacturing lead for the Z238 line of widgets. A large number of substring entries are generated for this string.

The example shows that indexing can be costly.

Creating Indexes

This section describes how to create presence, equality, approximate, substring, and international indexes for specific attributes using the Directory Server Console and the command-line.

Note

Given that this version of Directory Server can operate in either a single or multi-database environment, you need to remember to create your new indexes in every database instance since newly created indexes are not automatically created in the other databases.

However, the same is not entirely true for default indexes because they are automatically present and maintained in subsequent database instances but not added to existing ones. In other words, the directory uses your most recently created set of default indexes in subsequent databases. This means that if you add a default index to your second database instance, it will not be maintained in your first database instance but will be maintained in any subsequent instances.

As the procedure for creating browsing indexes is different, it is covered in a separate section.

This section contains the following procedures:

Creating Indexes from the Server Console
Creating Indexes from the Command-Line
Creating VLV Indexes from the Server Console
Creating VLV Indexes from the Command-Line

Creating Indexes from the Server Console

Using the Directory Server Console, you can create presence, equality, approximate, substring, and international indexes for specific attributes.

To create indexes:

In the Directory Server Console, select the Configuration tab.
Expand the Data node, expand the suffix of the database you want to index, and select the database.
Select the Indexes tab in the right pane.

Note

Do not click on the Database Settings node because this will take you to the Default Index Settings window and not the window for configuring indexes per database.

If the attribute you want to index is listed in the Additional Indexes table, skip to Step 6. Otherwise, click Add Attribute.

A dialog box appears containing a list of all of the available attributes in the server schema.

Select the attribute you want to index, and click OK.

The server adds the attribute to the Additional Indexes table.

Select the checkbox for each type of index you want to maintain for each attribute.
If you want to create an index for a language other than English, enter the OID of the collation order you want to use in the Matching Rules field.

You can index the attribute using multiple languages by listing multiple OIDs separated by commas (but no whitespace). For a list of languages, their associated OIDs, and further information regarding collation orders, see Appendix D, "Internationalization."

Click Save.

The Indexes dialog box appears, displaying the status of the index creation and informing you when the indexes have been created. You can click on the Status Logs box to view the status of the indexes created. Once the indexing is complete, click Close to close the Indexes dialog box.

The new index is immediately active for any new data that you add and any existing data in your directory. You do not have to restart your server.

Creating Indexes from the Command-Line

You can create presence, equality, approximate, substring, and international indexes for specific attributes from the command-line.

Creating indexes from the command-line involves two steps:

Using the ldapmodify command-line utility to add a new index entry or edit an existing index entry.
Running the db2index.pl Perl script to generate the new set of indexes to be maintained by the server.

Note

You cannot create new system indexes because system indexes are hard-coded in Directory Server.

The following sections describe the steps involved in creating indexes.

Adding an Index Entry

Use ldapmodify to add the new index attributes to your directory. If you want to create a new index that will become one of the default indexes, add the new index attributes to the cn=default indexes,cn=config,cn=ldbm database, cn=plugins,cn=config entry.

To create a new index for a particular database, add it to the cn=index,cn=database_name,cn=ldbm database,cn=plugins,cn=config entry, where cn=database_name corresponds to the name of the database.

Note

Avoid creating entries under cn=config in the dse.ldif file. The cn=config entry in the simple, flat dse.ldif configuration file is not stored in the same highly scalable database as regular entries. As a result, if many entries, particularly entries that are likely to be updated frequently, are stored under cn=config, performance will probably suffer.

Although we recommend you do not store simple user entries under cn=config for performance reasons, it can be useful to store special user entries such as the Directory Manager entry or Replication Manager (supplier bind DN) entry under cn=config since this allows you to centralize configuration information.

For information on the LDIF update statements required to add entries, see "LDIF Update Statements," on page 65.

For example, assume you want to create presence, equality, and substring indexes for the sn (surname) attribute in the Example1 database.

First, type the following to change to the directory containing the utility:
cd serverRoot/shared/bin
 
Run the ldapmodify command-line utility as follows:
ldapmodify -a -h server -p 389 -D "cn=directory manager" -w 
password
 
The ldapmodify utility binds to the server and prepares it to add an entry to the configuration file.

Next, you add the following entry for the new indexes:
dn: cn=sn,cn=index,cn=Example1,cn=ldbm 
database,cn=plugins,cn=config

objectClass:top

objectClass:nsIndex

cn:sn

nsSystemIndex:false

nsIndexType:pres

nsIndexType:eq

nsIndexType:sub

nsMatchingRule:2.16.840.1.113730.3.3.2.3.1
 
The cn attribute contains the name of the attribute you want to index, in this example, the sn attribute. The entry is a member of the nsIndex object class. The nsSystemIndex attribute is false, indicating that the index is not essential to Directory Server operations. The multi-valued nsIndexType attribute specifies the presence (pres), equality (eq) and substring (sub) indexes. Each keyword has to be entered on a separate line. The nsMatchingRule attribute specifies the OID of the Bulgarian collation order.

Specifying an index entry with no value in the nsIndexType attribute results in all indexes (except international) being maintained for the specified attribute. For example, suppose instead that you specify the following entry for your new sn indexes:
dn: cn=sn,cn=index,cn=database_name,cn=ldbm 
database,cn=plugins,cn=config

objectClass:top

objectClass:nsIndex

cn:sn

nsSystemIndex:false

nsIndexType:
 
This new entry results in all indexes for the sn (surname) attribute.

You can use the keyword none in the nsIndexType attribute to specify that no indexes are to be maintained for the attribute. For example, suppose you want to temporarily disable the sn indexes you just created on the Example1 database. You change the nsIndexType to none as follows:
dn: cn=sn,cn=index,cn=Example1,cn=ldbm 
database,cn=plugins,cn=config

objectClass:top

objectClass:nsIndex

cn:sn

nsSystemIndex:false

nsIndexType:none
 
For a complete list of collation orders and their OIDs, see Appendix D, "Internationalization."

For more information about the index configuration attributes, see the Red Hat Directory Server Configuration, Command, and File Reference.

For more information about the ldapmodify command-line utility, refer to the Red Hat Directory Server Configuration, Command, and File Reference.

Note

You should always use the attribute's primary name (not the attribute's alias) when creating indexes. The primary name of the attribute is the first name listed for the attribute in the schema; for example, uid for the userid attribute. See Table 10-3, on page 422,, for a list of all primary and alias attribute names.

Running the db2index.pl Script

Once you have created an indexing entry or added additional index types to an existing indexing entry, run the db2index.pl script to generate the new set of indexes to be maintained by the Directory Server. Once you run the script, the new set of indexes is active for any new data you add to your directory and any existing data in your directory.

To run the db2index.pl Perl script:

From the command-line, change to the following directory:
serverRoot/slapd-serverID/
 
Run the db2index.pl Perl script.

For more information about using this Perl script, refer to Red Hat Directory Server Configuration, Command, and File Reference.

This example generates an index using the db2index.pl UNIX shell script:
db2index.pl -D "cn=Directory Manager" -w passsword  -n 
ExampleServer -t sn
 
The following table describes the db2index.pl options used in the examples:

Option

Description

-D

Specifies the DN of the administrative user.

-w

Specifies the password of the administrative user.

-n

Specifies the name of the database into which you are importing the data.

-t

Specifies the name of the attribute contained by the index.

For more information about the db2index.pl Perl script, see the Red Hat Directory Server Configuration, Command, and File Reference.

Creating VLV Indexes from the Server Console

A virtual list view (VLV) index is a way of creating a truncated list for faster searching while enhancing server performance. The VLV index itself can be resource-intensive to maintain, but it can be beneficial in large directories (over 1000 entries).

To create a VLV index from the Console:

In the Directory Server Console, select the Directory tab.
In the left navigation tree, create a suffix where you want to create the VLV index. For instance, for views based on the locality (l) attribute, name this organizational unit Location Views.
Right-click on ou=Location Views, and select New>Other.
Select nsview from the New Object menu, and hit okay.
In the Property Editor window, hit the "Add Value" button, and add the organization unit object class.
Name the organization unit according to how you want to organize the VLVs you will create. For instance, ou=Sunnyvale. Make the ou attribute the naming attribute.
Hit the "Add Attribute" button, and add the nsviewfilter attribute.
Create a filter that reflects the views you will create. For example:
(l=Sunnyvale)
 
Hit okay to close the attributes box, and hit okay again to save the new VLV index.

The new index is immediately active for any new data that you add to your directory. You do not have to restart your server.

For more information on how to change the VLV search information or the access control rules that are set by default for VLV searches, see "Adding a Browsing Index Entry," on page 407, and "Setting Access Control for VLV Information," on page 410.

A browsing index is a type of VLV index that organizes the entries listed into alphabetical order, making it easier to find entries. To create a browsing index using the Directory Server Console:

In the Directory Server Console, select the Directory tab.
In the left navigation tree (for example, People), select the entry for which you want to create the index.
From the Object menu, select Create Browsing Index.

The Create Browsing Index dialog box appears displaying the status of the index creation. You can click on the Status Logs box to view the status of the indexes created.

Click Close to close the Create Browsing Index dialog box.

The new index is immediately active for any new data that you add to your directory. You do not have to restart your server.

Creating VLV Indexes from the Command-Line

Creating a browsing index or virtual list view (VLV) index from the command-line involves these steps:

Using ldapmodify to add new browsing index entries or edit existing browsing index entries.
Running the vlvindex script to generate the new set of browsing indexes to be maintained by the server.
Ensuring that access control on VLV index information is set appropriately.

The following sections describe the steps involved in creating browsing indexes.

Adding a Browsing Index Entry

The type of browsing index entry you want to create depends on the type of ldapsearch attribute sorting you want to accelerate. It is important to take the following into account:

The scope of the search (base, one, sub).

For more information on the ldapsearch -s option, which allows you to specify the scope of searches, see the Red Hat Directory Server Configuration, Command, and File Reference.

The base of the search (the entry you want to use as a starting point for the search).

For more information on the ldapsearch -b option, which allows you to specify the base of searches, see the Red Hat Directory Server Configuration, Command, and File Reference.

The attributes you want to sort.
The filter of the search.

For more information on specifying filters for searches, see Appendix B, "Finding Directory Entries."

The ldbm database to which the entry that forms the base of the search belongs. You can only create browsing indexes in ldbm databases.

For example, you want to create a browsing index to accelerate an ldapsearch on the entry ou=People,dc=example,dc=com held in the Example1 database where:

the search base is ou=People,dc=example,dc=com,
the search filter is (|(objectclass=*)(objectclass=ldapsubentry)),
the scope is one, and
the sorting order for the returned attributes is cn, givenname, o, ou, and sn.

First, type the following to change to the directory containing the utility:
cd serverRoot/shared/bin
 
Run the ldapmodify command-line utility as follows:
ldapmodify -a -h server -p 389 -D "cn=directory manager" -w 
password
 
The ldapmodify utility binds to the server and prepares it to add an entry to the configuration file.

Next, you need to add two browsing index entries which define your browsing index.

The first entry you add specifies the base, scope, and filter of the browsing index:
dn: cn=MCC ou=People  dc=example dc=com, cn=userRoot, cn=ldbm 
database, cn=plugins, cn=config

objectClass: top

objectClass: vlvSearch

cn: MCC ou=People  dc=example dc=com

vlvBase: ou=People, dc=example,dc=com

vlvScope: 1

vlvFilter: (|(objectclass=*)(objectclass=ldapsubentry))
 
The cn contains the browsing index identifier, which specifies the entry on which you want to create the browsing index; in this example, the ou=People,dc=example,dc=com entry. We recommend you use the dn of the entry for your browsing index identifier, which is the approach adopted by the Directory Server Console, to prevent identical browsing indexes from being created. The entry is a member of the vlvSearch object class.

The vlvbase attribute value specifies the entry on which you want to create the browsing index; in this example, the ou=People,dc=example,dc=com entry (the browsing index identifier).

The vlvscope attribute is 1, indicating that the base for the search you want to accelerate is 1. A search base of 1 means that only the immediate children of the entry specified in the cn attribute, and not the entry itself, will be searched.

The vlvfilter specifies the filter to be used for the search; in this example, (|(objectclass=*)(objectclass=ldapsubentry)).

The second entry you add specifies the sorting order you want for the returned attributes:
dn: cn=by MCC ou=People dc=example dc=com,cn=MCC ou=People  
dc=example dc=com, cn=userRoot, cn=ldbm database, cn=plugins, 
cn= config

objectClass: top

objectClass: vlvIndex

cn: by MCC ou=People  dc=example dc=com

vlvSort: cn givenname o ou sn
 
The cn contains the browsing index sort identifier. The above cn is the type created by the Console by default, which has the sorting order as being set "by" the browsing index base. The entry is a member of the vlvIndex object class.

The vlvsort attribute value specifies the order in which you want your attributes to be sorted; in this example, cn, givenname, o, ou, and then sn.

Note

This first browsing index entry must be added to the cn=database_name,cn=ldbm database,cn=plugins,cn=config directory tree node, and the second entry must be a child of the first entry.

Running the vlvindex Script

Once you have created the two browsing indexing entries or added additional attribute types to an existing indexing browsing entries, run the vlvindex script to generate the new set of browsing indexes to be maintained by the Directory Server. After you run the script, the new set of browsing indexes is active for any new data you add to your directory and any existing data in your directory.

To run the vlvindex script:

From the command-line, change to the following directory:
serverRoot/slapd-serverID/
 
Stop the server:
./stop-slapd
 
Run the vlvindex script.

For more information about using this script, refer to Red Hat Directory Server Configuration, Command, and File Reference.

Restart the server.
./start-slapd
 
This example generates a browsing index using the vlvindex UNIX shell script:
vlvindex -n Example1 -T "by MCC ou=people dc=example dc=com"
 
The following table describes the vlvindex options used in the examples:

Option

Description

-n

Name of the database containing the entries to index.

-T

Browsing index identifier to use to create browsing indexes.

For more information about the vlvindex script, see the Red Hat Directory Server Configuration, Command, and File Reference.

Setting Access Control for VLV Information

The default access control for the VLV index information is to allow anyone who has authenticated. If a site requires anonymous users to use the VLV index information, modify the access control set for cn: VLV Request Control in the Directory Server's configuration.

Change to the configuration directory: serverRoot/slapd-serverID/config
In a text editor, open the dse.ldif file.
Locate oid=2.16.840.1.113730.3.4.9; you should see these lines:

dn: oid=2.16.840.1.113730.3.4.9,cn=features,cn=config
objectClass: top
objectClass: directoryServerFeature
oid: 2.16.840.1.113730.3.4.9
cn: VLV Request Control
aci: (targetattr != "aci")(version 3.0; acl "VLV Request Control"; allow( read, search, compare, proxy ) userdn = "ldap:///all" ;)
creatorsName: cn=server,cn=plugins,cn=config
modifiersName: cn=server,cn=plugins,cn=config
...

Change "ldap://all" to "ldap://anyone" and save your changes.

Deleting Indexes

This section describes how to delete presence, equality, approximate, substring, international, and browsing indexes for specific attributes.

Note

Because this version of Directory Server can operate in either a single or multi-database environment, you have to delete any unwanted indexes from every database instance. Any default indexes you delete will not be deleted from previous sets of indexes on existing database instances.

As the procedure for deleting browsing indexes is different, it is covered in a separate section. This section contains the following procedures:

Deleting Indexes from the Server Console
Deleting Indexes from the Command-Line
Deleting Browsing and VLV Indexes from the Server Console
Deleting Browsing and VLV Indexes from the Command-Line

Caution

You must not delete system indexes because deleting them can significantly affect Directory Server performance. System indexes are located in the cn=index,cn=instance,cn=ldbm database,cn=plugins,cn=config entry and the cn=default indexes,cn=config,cn=ldbm database,cn=plugins,cn=config entry.

Take care when deleting default indexes since this can also affect how Directory Server works.

For further information on system and default indexes, see the Red Hat Directory Server Deployment Guide.

Deleting Indexes from the Server Console

Using the Directory Server Console you can delete indexes you have created, indexes used by other server applications such as a messaging or web server, and default indexes. You cannot delete system indexes.

To delete indexes using the Directory Server Console:

In the Directory Server Console, select the Configuration tab.
Expand the Data node and expand the suffix associated with the database containing the index. Select the database from which you want to delete the index.
Locate the attribute containing the index you want to delete. Clear the checkbox under the index.

If you want to delete all indexes maintained for a particular attribute, select the attribute's cell under Attribute Name, and click Delete Attribute.

Click Save.

A Delete Index warning dialog box appears asking you to confirm that you want to delete the index.

Click Yes to delete the index.
The Delete Browsing Index dialog box appears displaying the status of the index deletion. You can click on the Status Logs button to view the status of the indexes deleted.
Once the indexing is complete, click on Close to close the Delete Browsing Index box.

Deleting Indexes from the Command-Line

You can browsing indexes, or virtual list view (VLV) indexes, using the ldapdelete command-line utility as follows:

Delete an entire index entry or delete unwanted index types from an existing index entry using the ldapdelete command-line utility.
Generate the new set of indexes to be maintained by the server using the db2index.pl script.

The following sections describe the steps involved in deleting an index.

Deleting an Index Entry

Use the ldapdelete command-line utility to delete either the entire indexing entry or the unwanted index types from an existing entry.

If you want to delete the indexes for a particular database, you remove your index entry from the cn=index,cn=database_name,cn=ldbm database,cn=plugins,cn=config entry, where cn=database_name corresponds to the name of the database.

To delete a default index, remove it from the cn=default indexes,cn=config,cn=ldbm database,cn=plugins,cn=config entry.

For example, you want to delete presence, equality, and substring indexes for the sn attribute on the database named Example1.

You want to delete the following entry:
dn: cn=sn,cn=index,cn=Example1,cn=ldbm 
database,cn=plugins,cn=config

objectClass:top

objectClass:nsIndex

cn:sn

nsSystemIndex:false

nsIndexType:pres

nsIndexType:eq

nsIndexType:sub

nsMatchingRule:2.16.840.1.113730.3.3.2.3.1
 
To run the ldapdelete command-line utility, type the following to change to the directory containing the utility:
cd serverRoot/shared/bin
 
Perform the ldapdelete as follows:
ldapdelete -D "cn=Directory Manager" -w password -h ExampleServer 
-p845 "cn=sn,cn=index,cn=Example1,dn=ldbm database, 
cn=plugins,dn=config"
 
The following table describes the ldapdelete options used in the example:

Option

Description

-D

Specifies the distinguished name with which to authenticate to the server. The value must be a DN recognized by the Directory Server, and it must also have the authority to modify the entries.

-w

Specifies the password associated with the distinguished name specified in the -D option.

-h

Specifies the name of the host on which the server is running.

-p

Specifies the port number that the server uses.
For full information on ldapdelete options, refer to the Red Hat Directory Server 
Configuration, Command, and File Reference.
 
Once you have deleted this entry, the presence, equality, and substring indexes for the sn attribute will no longer be maintained by the Example1 database.

Running the db2index.pl Script

Once you have deleted an indexing entry or deleted some of the index types from an indexing entry, run the db2index.pl script to generate the new set of indexes to be maintained by the Directory Server. Once you run the script, the new set of indexes is active for any new data you add to your directory and any existing data in your directory.

To run the db2index.pl Perl script:

From the command-line, change to the following directory:
serverRoot/slapd-serverID/
 
Run the db2index.pl Perl script.

For more information about using the db2index.pl Perl script, refer to Red Hat Directory Server Configuration, Command, and File Reference.

This example generates a new set of indexes to be maintained by the server, using the db2index.pl UNIX shell script:
db2index.pl -D "cn=Directory Manager" -w password -n Example1
 
The following table describes the db2index.pl options used in the examples:

Option

Description

-D

Specifies the DN of the administrative user.

-w

Specifies the password of the administrative user.

-n

Specifies the name of the database into which you are importing the data.

For more information about the db2index.pl Perl script, see the Red Hat Directory Server Configuration, Command, and File Reference.

Deleting Browsing and VLV Indexes from the Server Console

Using the Directory Server Console, you can delete browsing indexes and VLV indexes.

To delete a virtual list:

In the Directory Server Console, select the Directory tab.
Select the virtual list that you want to delete, such as ou=Sunnyvale,ou=LocationViews,dc=example,dc=com (to delete all the virtual lists, you can delete the entire subsuffix, ou=LocationViews,dc=example,dc=com).
Right-click on the entry, and select Delete from the drop-down menu. You can also select the entry, and select Object>Delete from the tool menu at the top.
A dialog box appears asking you to confirm that you want to delete the entry. Click Yes to delete.

To delete a browsing index:

In the Directory Server Console, select the Directory tab.
Select the entry from which you want to delete the index in the navigation tree, and select Delete Browsing Index from the Object menu. You can also select and right-click the entry of the index you want to delete in the navigation tree and then choose Delete Browsing Index from the pop-up menu.
A Delete Browsing Index dialog box appears asking you to confirm that you want to delete the index. Click Yes to delete.
The Delete Browsing Index dialog box appears displaying the status of the index deletion.

Deleting Browsing and VLV Indexes from the Command-Line

Deleting a browsing index or virtual list view (VLV) index from the command-line involves two steps:

Using the ldapdelete to delete browsing index entries or edit existing browsing index entries.
Running the vlvindex script to generate the new set of browsing indexes to be maintained by the server.

The actual entries for an alphabetical browsing index and virtual list view are the same. The following sections describe the steps involved in deleting browsing indexes.

Deleting a Browsing Index Entry

Use the ldapdelete command-line utility to either delete browsing indexing entries or edit existing browsing index entries.

To delete browsing indexes for a particular database, you remove your browsing index entries from the cn=index,cn=database_name,cn=ldbm database,cn=plugins,cn=config entry, where cn=database_name corresponds to the name of the database.

For example, you want to delete a browsing index for accelerating ldapsearch operations on the entry ou=People,dc=example,dc=com held in the Example1 database where the search base is ou=People,dc=example,dc=com, the search filter is (|(objectclass=*)(objectclass=ldapsubentry)), the scope is 1, and the sorting order for the returned attributes is cn, givenname, o, ou, and sn.

To delete this browsing index, you need to delete the two corresponding browsing index entries which follow:
dn: cn=MCC ou=People  dc=example dc=com, cn=userRoot, cn=ldbm 
database, cn=plugins, cn=config

objectClass: top

objectClass: vlvSearch

cn: MCC ou=People  dc=example dc=com

vlvBase: ou=People, dc=example,dc=com

vlvScope: 1

vlvFilter: (|(objectclass=*)(objectclass=ldapsubentry))
 
and
dn: cn=by MCC ou=People  dc=example dc=com,cn=MCC ou=People  
dc=example dc=com, cn=userRoot, cn=ldbm database, cn=plugins, 
cn= config

objectClass: top

objectClass: vlvIndex

cn: by MCC ou=People  dc=example dc=com

vlvSort: cn givenname o ou sn
 
To run the ldapdelete command-line utility, type the following to change to the directory containing the utility:
cd serverRoot/shared/bin
 
Perform the ldapdelete as follows:
ldapdelete -D "cn=Directory Manager" -w password -h ExampleServer 
-p 845 "cn=MCC ou=People  dc=example dc=com, cn=userRoot, cn=ldbm 
database, cn=plugins, cn=config"

"cn=by MCC ou=People  dc=example dc=com,cn=MCC ou=People  
dc=example dc=com, cn=userRoot, cn=ldbm database, cn=plugins, cn= 
config"
 
The following table describes the ldapdelete options used in the example:

Option

Description

-D

Specifies the distinguished name with which to authenticate to the server. The value must be a DN recognized by the Directory Server, and it must also have the authority to modify the entries.

-w

Specifies the password associated with the distinguished name specified in the -D option.

-h

Specifies the name of the host on which the server is running.

-p

Specifies the port number that the server uses.

For full information on ldapdelete options, refer to the Red Hat Directory Server Configuration, Command, and File Reference.

Once you have deleted these two browsing index entries, the browsing index for accelerating ldapsearch operations on the entry ou=People dc=example dc=com held in the Example1 database where the search base is ou=People,dc=example,dc=com, the search filter is (|(objectclass=*)(objectclass=ldapsubentry)), the scope is 1, and the sorting order for the returned attributes is cn, givenname, o, ou, and sn will no longer be maintained by the Example1 database.

Running the vlvindex Script

Once you have deleted browsing indexing entries or deleted unwanted attribute types from existing browsing indexing entries, run the vlvindex script to generate the new set of browsing indexes to be maintained by the Directory Server. Once you run the script, the new set of browsing indexes is active for any new data you add to your directory and any existing data in your directory.

To run the vlvindex script:

From the command-line, change to the following directory:
serverRoot/slapd-serverID/
 
Stop the server:
./stop-slapd
 
Run the vlvindex script.

For more information about using the vlvindex script, refer to Red Hat Directory Server Configuration, Command, and File Reference.

Restart the server:
./start-slapd
 
This example recreates the indexes using the vlvindex UNIX shell script:
vlvindex -n Example1 -T "by MCC ou=people dc=example dc=com"
 
The following table describes the vlvindex options used in the examples.

Option

Description

-n

Name of the database containing the entries to index.

-T

Browsing index identifier to use to create browsing indexes.

For more information about the vlvindex script, see the Red Hat Directory Server Configuration, Command, and File Reference.

Managing Indexes

Each index that the directory uses is composed of a table of index keys and matching entry ID lists. This entry ID list is used by the directory to build a list of candidate entries that may match a client application's search request (see "About Indexes," on page 391, for details).

The redesigned secondary index structure greatly improves write, search, and indexing operations. The following sections examine enhanced indexing operations and the BerkeleyDB design, searching and the old All IDs Threshold, and migrating and compatibility with previous versions of Directory Server.

Indexing Performance

While achieving extremely high read performance, in previous versions of Directory Server, write performance was limited by the number of bytes per second that could be written into the storage manager's transaction log file. Large log files were generated for each LDAP write operation; in fact, "log file verbosity" could easily be 100 times the corresponding number of bytes changed in the Directory Server. The majority of the contents in the log files are related to index changes (ID insert and delete operations).

The secondary index structure was separated into two levels in the old design:

The ID list structures, which were the province of the Directory Server backend and opaque to the storage manager.
The storage manager structures (Btrees), which were opaque to the Directory Server backend code.

Because it had no insight into the internal structure of the ID lists, the storage manager had to treat ID lists as opaque byte arrays. From the storage manager's perspective, when the content of an ID list changed, the entire list had changed. For a single ID that was inserted or deleted from an ID list, the corresponding number of bytes written to the transaction log was the maximum configured size for that ID list, about 8Kbytes. Also, every database page on which the list was stored was marked as dirty, since the "entire" list had changed.

In the redesigned index, the storage manager has visibility into the fine-grain index structure, which optimizes transaction logging so that only the number of bytes actually changed need to be logged for any given index modification. The BerkeleyDB feature provides ID list semantics, which are implemented by the storage manager. The Berkeley API was enhanced to support the insertion and deletion of individual IDs stored against a common key, with support for duplicate keys, and an optimized mechanism for the retrieval of the complete ID list for a given key.

The storage manager has direct knowledge of the application's intent when changes are made to ID lists. As a result:

For long ID lists, the number of bytes written to the transaction log for any update to the list is significantly reduced, from the maximum ID list size (8Kbytes) to twice the size of one ID (4bytes).
For short ID lists, storage efficiency, and in most cases performance, is improved because only the storage manager metadata need to be stored, not the ID list metadata.
The average number of database pages marked as dirty per ID insert or delete operation is very small because a large number of duplicate keys will fit into each database page.

Search Performance

For each entry ID list, there is a size limit that is globally applied to all index keys managed by the server. In previous versions of Directory Server, this limit was called the All IDs Threshold. Because maintaining large ID lists in memory can affect performance, the All IDs Threshold set a limit on how large a single entry ID list could get. When a list hit a certain pre-determined size, the search would treat it as if the index contained the entire directory.

The difficulty in setting the All IDs Threshold hurt peformance. If the threshold was too low, too many searches examined every entry in the directory. If it was too high, too many large ID lists had to be maintained in memory.

The problems addressed by the All IDs Threshold are no longer present because of the efficiency of entry insertion, modification, and deletion in the BerkeleyDB design. The All IDs Threshold is removed for database write operations, and every ID list is now maintained accurately.

Since loading a long ID list from the database can significantly reduce search performance, the configuration parameter, nsslapd-idlistscanlimit, sets a limit on the number of IDs that are read before a key is considered to match the entire primary index. nsslapd-idlistscanlimit is analagous to the All IDs Threshold, but it only applies to the behavior of the server's search code, not the content of the database.

idlistscanlimit

When the server uses indexes in the processing of a search operation, it is possible that one index key matches a large number of entries. For example, consider a search for objectclass=inetorgperson in a directory that contained one million inetorgperson entries. When the server reads the inetorgperson index key in the objectclass index, it would find one million matching entries. In cases like this, it is usually more efficient simply to conclude in the index lookup phase of the search operation processing that all the entries in the database match the query. This causes subsequent search processing to scan the entire database content, checking each entry as to whether it matches the search filter. The idea is that the time required to fetch the index keys is not worthwhile; the search operation is likely to be processed more efficiently by omitting the index lookup.

Directory Server implements this search optimization as follows: when examining an index, if more than a certain number of entries are found, the server will stop reading the index and mark the search as unindexed with respect to that particular index.

The threshold number of entries is called the idlistscanlimit and is configured with the nsslapd-idlistscanlimit configuration attribute. The default value is 4000, which is designed to give good performance for a common range of database sizes and access patterns. Typically, it is not necessary to change this value. However, in rare circumstances it may be possible to improve search performance with a different value. For example, lowering the value will improve performance for searches that will otherwise eventually hit the default limit of 4000. This might, of course, reduce performance for other searches that would benefit from indexing. Conversely, increasing the limit could improve performance for searches that were previously hitting the limit. With a higher limit, these searches could benefit from indexing where previously they did not.

For more information on search limits for the server, see "Overview of the Searching Algorithm," on page 395.

Backwards Compatibility and Migration

While current versions of Directory Server can support the old database design, only the new design is supported for this and later releases of Directory Server.

Upon startup, the server will read the database version from the DBVERSION file, which contains the text Netscape-ldbm/6.2 (old database version) and Netscape-ldbm/7.1 (new database format). If the file indicates that the old format is used, then the old code is selected for the database. Because the DBVERSION file stores everything per-backend, it is theoretically possible to have different database formats for different individual backends. However, the old database format is not recommended.

All databases must be migrated to Directory Server 7.1 when the system is upgraded. Migration is supported for Directory Server 6.x versions. Version 7.1 can be installed "over" the previous releases, resulting in a working server that supports the new database design. Migrating the databases is a lengthy operation; do not do this at the time of installation.

For releases earlier than version 6.11, it is recommended that the databases be dumped, and Directory Server installed fresh.

For more information on migrating databases, see chapter 6, "Migrating from Previous Versions," in the Red Hat Directory Server Installation Guide.

Also, the index sizes can be larger than in previous releases, so you may want to increase your database cache size. To reconfigure your cache size, refer to "nsslap-dbcachesize" in the Red Hat Directory Server Configuration, Command, and File Reference.

Attribute Name Quick Reference Table

Table 10-3 lists all attributes which have a primary or real name as well as an alias. When creating indexes be sure to use the primary name.

Table 10-3 Attribute Name Quick Reference Table

Attribute Primary Name

Attribute Alias

dn

distinguishedName

cn

commonName

sn

surName

c

countryName

l

localityName

st

stateOrProvinceName

street

streetAddress

o

organization

ou

organizationalUnitName

facsimileTelephoneNumber

fax

uid

userId

mail

rfc822mailbox

mobile

mobileTelephoneNumber

pager

pagerTelephoneNumber

co

friendlyCountryName

labeledUri

labeledUri

ttl

timeToLive

dc

domainComponent

authorCn

documentAuthorCommonName

authorSn

documentAuthorSurname

drink

favoriteDrink

Table 10-1 Default Indexes
Attribute	Eq	Pres	Sub	Purpose
cn	X	X	X	Improves the performance of the most common types of user directory searches.
givenName	X	X	X	Improves the performance of the most common types of user directory searches.
mail	X	X	X	Improves the performance of the most common types of user directory searches.
mailHost	X	-	-	Used by a messaging server.
member	X	-	-	Improves Directory Server performance. This index is also used by the Referential Integrity Plug-in. See "Maintaining Referential Integrity," on page 75, for more information.
owner	X	-	-	Improves Directory Server performance. This index is also used by the Referential Integrity Plug-in. See "Maintaining Referential Integrity," on page 75, for more information.
seeAlso	X	-	-	Improves Directory Server performance. This index is also used by the Referential Integrity Plug-in. See "Maintaining Referential Integrity," on page 75, for more information.
sn	X	X	X	Improves the performance of the most common types of user directory searches.
telephoneNumber	X	X	X	Improves the performance of the most common types of user directory searches.
uid	X	-	-	Improves Directory Server performance.
uniquemember	X	-	-	Improves Directory Server performance. This index is also used by the Referential Integrity Plug-in. See "Maintaining Referential Integrity," on page 75, for more information.

Table 10-2 System Indexes
Attribute	Eq	Pres	Purpose
aci	-	X	Allows the Directory Server to quickly obtain the access control information maintained in the database.
dnComp	X	-	Used to help accelerate subtree searches in the directory.
objectClass	X	-	Used to help accelerate subtree searches in the directory.
entryDN	X	-	Speeds up entry retrieval based on DN searches.
parentID	X	-	Enhances directory performance during one-level searches.
numSubordinates	-	X	Used by the Directory Server Console to enhance display performance on the Directory tab.
nsUniqueID	X	-	Used to search for specific entries.

Note	The metaphone phonetic algorithm in Directory Server supports only US-ASCII letters. Therefore, use approximate indexing only with English values.

Name in the Directory (Phonetic Code)	Query String (Phonetic code)	Match Comments
Alice B Sarette (ALS B SRT)	Alice Sarette (ALS SRT)	Matches. Codes are specified in the correct order.
	Alice Sarrette (ALS SRT)	Matches. Codes are specified in the correct order, despite the misspelling of Sarette.
	Surette (SRT)	Matches. The generated code exists in the original name, despite the misspelling of Sarette.
	Bertha Sarette (BR0 SRT)	No match. The code BR0 does not exist in the original name.
	Sarette, Alice (SRT ALS)	No match. The codes are not specified in the correct order.

Note	Given that this version of Directory Server can operate in either a single or multi-database environment, you need to remember to create your new indexes in every database instance since newly created indexes are not automatically created in the other databases. However, the same is not entirely true for default indexes because they are automatically present and maintained in subsequent database instances but not added to existing ones. In other words, the directory uses your most recently created set of default indexes in subsequent databases. This means that if you add a default index to your second database instance, it will not be maintained in your first database instance but will be maintained in any subsequent instances.

Note	Do not click on the Database Settings node because this will take you to the Default Index Settings window and not the window for configuring indexes per database.

Note	You cannot create new system indexes because system indexes are hard-coded in Directory Server.

Note	Avoid creating entries under cn=config in the dse.ldif file. The cn=config entry in the simple, flat dse.ldif configuration file is not stored in the same highly scalable database as regular entries. As a result, if many entries, particularly entries that are likely to be updated frequently, are stored under cn=config, performance will probably suffer. Although we recommend you do not store simple user entries under cn=config for performance reasons, it can be useful to store special user entries such as the Directory Manager entry or Replication Manager (supplier bind DN) entry under cn=config since this allows you to centralize configuration information.

Note	You should always use the attribute's primary name (not the attribute's alias) when creating indexes. The primary name of the attribute is the first name listed for the attribute in the schema; for example, uid for the userid attribute. See Table 10-3, on page 422,, for a list of all primary and alias attribute names.

Option	Description
-D	Specifies the DN of the administrative user.
-w	Specifies the password of the administrative user.
-n	Specifies the name of the database into which you are importing the data.
-t	Specifies the name of the attribute contained by the index.

Note	This first browsing index entry must be added to the cn=database_name,cn=ldbm database,cn=plugins,cn=config directory tree node, and the second entry must be a child of the first entry.

Option	Description
-n	Name of the database containing the entries to index.
-T	Browsing index identifier to use to create browsing indexes.

Note	Because this version of Directory Server can operate in either a single or multi-database environment, you have to delete any unwanted indexes from every database instance. Any default indexes you delete will not be deleted from previous sets of indexes on existing database instances.

Caution	You must not delete system indexes because deleting them can significantly affect Directory Server performance. System indexes are located in the cn=index,cn=instance,cn=ldbm database,cn=plugins,cn=config entry and the cn=default indexes,cn=config,cn=ldbm database,cn=plugins,cn=config entry. Take care when deleting default indexes since this can also affect how Directory Server works. For further information on system and default indexes, see the Red Hat Directory Server Deployment Guide.

Option	Description
-D	Specifies the distinguished name with which to authenticate to the server. The value must be a DN recognized by the Directory Server, and it must also have the authority to modify the entries.
-w	Specifies the password associated with the distinguished name specified in the -D option.
-h	Specifies the name of the host on which the server is running.
-p	Specifies the port number that the server uses.

Option	Description
-D	Specifies the DN of the administrative user.
-w	Specifies the password of the administrative user.
-n	Specifies the name of the database into which you are importing the data.

Option	Description
-D	Specifies the distinguished name with which to authenticate to the server. The value must be a DN recognized by the Directory Server, and it must also have the authority to modify the entries.
-w	Specifies the password associated with the distinguished name specified in the -D option.
-h	Specifies the name of the host on which the server is running.
-p	Specifies the port number that the server uses.

Option	Description
-n	Name of the database containing the entries to index.
-T	Browsing index identifier to use to create browsing indexes.

Table 10-3 Attribute Name Quick Reference Table
Attribute Primary Name	Attribute Alias
dn	distinguishedName
cn	commonName
sn	surName
c	countryName
l	localityName
st	stateOrProvinceName
street	streetAddress
o	organization
ou	organizationalUnitName
facsimileTelephoneNumber	fax
uid	userId
mail	rfc822mailbox
mobile	mobileTelephoneNumber
pager	pagerTelephoneNumber
co	friendlyCountryName
labeledUri	labeledUri
ttl	timeToLive
dc	domainComponent
authorCn	documentAuthorCommonName
authorSn	documentAuthorSurname
drink	favoriteDrink

Contents

Index