[MS-WSP]:

Windows Search Protocol

Intellectual Property Rights Notice for Open Specifications Documentation

Technical Documentation. Microsoft publishes Open Specifications documentation (“this documentation”) for protocols, file formats, data portability, computer languages, and standards support. Additionally, overview documents cover inter-protocol relationships and interactions.

Copyrights. This documentation is covered by Microsoft copyrights. Regardless of any other terms that are contained in the terms of use for the Microsoft website that hosts this documentation, you can make copies of it in order to develop implementations of the technologies that are described in this documentation and can distribute portions of it in your implementations that use these technologies or in your documentation as necessary to properly document the implementation. You can also distribute in your implementation, with or without modification, any schemas, IDLs, or code samples that are included in the documentation. This permission also applies to any documents that are referenced in the Open Specifications documentation.

No Trade Secrets. Microsoft does not claim any trade secret rights in this documentation.

Patents. Microsoft has patents that might cover your implementations of the technologies described in the Open Specifications documentation. Neither this notice nor Microsoft's delivery of this documentation grants any licenses under those patents or any other Microsoft patents. However, a given Open Specifications document might be covered by the Microsoft Open Specifications Promise or the Microsoft Community Promise. If you would prefer a written license, or if the technologies described in this documentation are not covered by the Open Specifications Promise or Community Promise, as applicable, patent licenses are available by contacting .

License Programs. To see all of the protocols in scope under a specific license program and the associated patents, visit the Patent Map.

Trademarks. The names of companies and products contained in this documentation might be covered by trademarks or similar intellectual property rights. This notice does not grant any licenses under those rights. For a list of Microsoft trademarks, visit

Fictitious Names. The example companies, organizations, products, domain names, email addresses, logos, people, places, and events that are depicted in this documentation are fictitious. No association with any real company, organization, product, domain name, email address, logo, person, place, or event is intended or should be inferred.

Reservation of Rights. All other rights are reserved, and this notice does not grant any rights other than as specifically described above, whether by implication, estoppel, or otherwise.

Tools. The Open Specifications documentation does not require the use of Microsoft programming tools or programming environments in order for you to develop an implementation. If you have access to Microsoft programming tools and environments, you are free to take advantage of them. Certain Open Specifications documents are intended for use in conjunction with publicly available standards specifications and network programming art and, as such, assume that the reader either is familiar with the aforementioned material or has immediate access to it.

Support. For questions and support, please contact .

Revision Summary

Date / Revision History / Revision Class / Comments
12/18/2006 / 0.1 / New / Version 0.1 release
3/2/2007 / 1.0 / Major / Version 1.0 release
4/3/2007 / 1.1 / Minor / Version 1.1 release
5/11/2007 / 1.2 / Minor / Version 1.2 release
6/1/2007 / 1.2.1 / Editorial / Changed language and formatting in the technical content.
7/3/2007 / 1.3 / Minor / Minor technical content changes.
7/20/2007 / 1.3.1 / Editorial / Changed language and formatting in the technical content.
8/10/2007 / 1.4 / Minor / Clarified the meaning of the technical content.
9/28/2007 / 1.5 / Minor / Revised technical and editorial content based on feedback.
10/23/2007 / 1.5.1 / Editorial / Changed language and formatting in the technical content.
11/30/2007 / 2.0 / Major / Revised technical content based on feedback.
1/25/2008 / 2.1 / Minor / Clarified the meaning of the technical content.
3/14/2008 / 2.1.1 / Editorial / Changed language and formatting in the technical content.
5/16/2008 / 2.1.2 / Editorial / Changed language and formatting in the technical content.
6/20/2008 / 3.0 / Major / Updated and revised the technical content.
7/25/2008 / 4.0 / Major / Updated and revised the technical content.
8/29/2008 / 5.0 / Major / Updated and revised the technical content.
10/24/2008 / 6.0 / Major / Updated and revised the technical content.
12/5/2008 / 7.0 / Major / Updated and revised the technical content.
1/16/2009 / 8.0 / Major / Updated and revised the technical content.
2/27/2009 / 9.0 / Major / Updated and revised the technical content.
4/10/2009 / 10.0 / Major / Updated and revised the technical content.
5/22/2009 / 11.0 / Major / Updated and revised the technical content.
7/2/2009 / 12.0 / Major / Updated and revised the technical content.
8/14/2009 / 13.0 / Major / Updated and revised the technical content.
9/25/2009 / 14.0 / Major / Updated and revised the technical content.
11/6/2009 / 15.0 / Major / Updated and revised the technical content.
12/18/2009 / 16.0 / Major / Updated and revised the technical content.
1/29/2010 / 17.0 / Major / Updated and revised the technical content.
3/12/2010 / 18.0 / Major / Updated and revised the technical content.
4/23/2010 / 19.0 / Major / Updated and revised the technical content.
6/4/2010 / 20.0 / Major / Updated and revised the technical content.
7/16/2010 / 21.0 / Major / Updated and revised the technical content.
8/27/2010 / 21.0 / None / No changes to the meaning, language, or formatting of the technical content.
10/8/2010 / 21.0 / None / No changes to the meaning, language, or formatting of the technical content.
11/19/2010 / 21.1 / Minor / Clarified the meaning of the technical content.
1/7/2011 / 22.0 / Major / Updated and revised the technical content.
2/11/2011 / 23.0 / Major / Updated and revised the technical content.
3/25/2011 / 23.0 / None / No changes to the meaning, language, or formatting of the technical content.
5/6/2011 / 24.0 / Major / Updated and revised the technical content.
6/17/2011 / 25.0 / Major / Updated and revised the technical content.
9/23/2011 / 25.0 / None / No changes to the meaning, language, or formatting of the technical content.
12/16/2011 / 26.0 / Major / Updated and revised the technical content.
3/30/2012 / 26.0 / None / No changes to the meaning, language, or formatting of the technical content.
7/12/2012 / 26.1 / Minor / Clarified the meaning of the technical content.
10/25/2012 / 26.1 / None / No changes to the meaning, language, or formatting of the technical content.
1/31/2013 / 26.1 / None / No changes to the meaning, language, or formatting of the technical content.
8/8/2013 / 27.0 / Major / Updated and revised the technical content.
11/14/2013 / 27.0 / None / No changes to the meaning, language, or formatting of the technical content.
2/13/2014 / 27.0 / None / No changes to the meaning, language, or formatting of the technical content.
5/15/2014 / 27.0 / None / No changes to the meaning, language, or formatting of the technical content.
6/30/2015 / 28.0 / Major / Significantly changed the technical content.
10/16/2015 / 29.0 / Major / Significantly changed the technical content.
7/14/2016 / 30.0 / Major / Significantly changed the technical content.
6/1/2017 / 30.0 / None / No changes to the meaning, language, or formatting of the technical content.

Table of Contents

1Introduction

1.1Glossary

1.2References

1.2.1Normative References

1.2.2Informative References

1.3Overview

1.3.1Remote Administration Tasks

1.3.2Remote Querying

1.4Relationship to Other Protocols

1.5Prerequisites/Preconditions

1.6Applicability Statement

1.7Versioning and Capability Negotiation

1.8Vendor-Extensible Fields

1.8.1Property IDs

1.9Standards Assignments

2Messages

2.1Transport

2.2Message Syntax

2.2.1Structures

2.2.1.1CBaseStorageVariant

2.2.1.1.1CBaseStorageVariant Structures

2.2.1.1.1.1DECIMAL

2.2.1.1.1.2VT_VECTOR

2.2.1.1.1.3SAFEARRAY

2.2.1.1.1.4SAFEARRAYBOUND

2.2.1.1.1.5SAFEARRAY2

2.2.1.1.1.6VT_COMPRESSED_LPWSTR

2.2.1.2CFullPropSpec

2.2.1.3CContentRestriction

2.2.1.4CInternalPropertyRestriction

2.2.1.5CNatLanguageRestriction

2.2.1.6CNodeRestriction

2.2.1.7CPropertyRestriction

2.2.1.8CReuseWhere

2.2.1.9CScopeRestriction

2.2.1.10CSort

2.2.1.11CVectorRestriction

2.2.1.12CCoercionRestriction

2.2.1.13CRelDocRestriction

2.2.1.14CProbRestriction

2.2.1.15CFeedbackRestriction

2.2.1.16CRestrictionArray

2.2.1.17CRestriction

2.2.1.18CColumnSet

2.2.1.19CCategorizationSet

2.2.1.20CCategorizationSpec

2.2.1.21CCategSpec

2.2.1.22CRangeCategSpec

2.2.1.23RANGEBOUNDARY

2.2.1.24CAggregSet

2.2.1.25CAggregSpec

2.2.1.26CSortAggregSet

2.2.1.27CAggregSortKey

2.2.1.28CInGroupSortAggregSets

2.2.1.29CInGroupSortAggregSet

2.2.1.30CDbColId

2.2.1.31CDbProp

2.2.1.31.1Database Properties

2.2.1.32CDbPropSet

2.2.1.33CPidMapper

2.2.1.34CColumnGroupArray

2.2.1.35CColumnGroup

2.2.1.36SProperty

2.2.1.37CRowSeekAt

2.2.1.38CRowSeekAtRatio

2.2.1.39CRowSeekByBookmark

2.2.1.40CRowSeekNext

2.2.1.41CRowsetProperties

2.2.1.42CTableVariant

2.2.1.43CSortSet

2.2.1.44CTableColumn

2.2.1.45SERIALIZEDPROPERTYVALUE

2.2.1.46CCompletionCategSpec

2.2.2Message Headers

2.2.3Messages

2.2.3.1CPMCiStateInOut

2.2.3.2CPMConnectIn

2.2.3.3CPMConnectOut

2.2.3.4CPMCreateQueryIn

2.2.3.5CPMCreateQueryOut

2.2.3.6CPMGetQueryStatusIn

2.2.3.7CPMGetQueryStatusOut

2.2.3.8CPMGetQueryStatusExIn

2.2.3.9CPMGetQueryStatusExOut

2.2.3.10CPMSetBindingsIn

2.2.3.11CPMGetRowsIn

2.2.3.12CPMGetRowsOut

2.2.3.13CPMRatioFinishedIn

2.2.3.14CPMRatioFinishedOut

2.2.3.15CPMFetchValueIn

2.2.3.16CPMFetchValueOut

2.2.3.17CPMGetNotify

2.2.3.18CPMSendNotifyOut

2.2.3.19CPMGetApproximatePositionIn

2.2.3.20CPMGetApproximatePositionOut

2.2.3.21CPMCompareBmkIn

2.2.3.22CPMCompareBmkOut

2.2.3.23CPMRestartPositionIn

2.2.3.24CPMFreeCursorIn

2.2.3.25CPMFreeCursorOut

2.2.3.26CPMDisconnect

2.2.3.27CPMFindIndicesIn

2.2.3.28CPMFindIndicesOut

2.2.3.29CPMGetRowsetNotifyIn

2.2.3.30CPMGetRowsetNotifyOut

2.2.3.31CPMSetScopePrioritizationIn

2.2.3.32CPMSetScopePrioritizationOut

2.2.3.33CPMGetScopeStatisticsIn

2.2.3.34CPMGetScopeStatisticsOut

2.2.4Errors

2.2.5Standard Properties

2.2.5.1Query Properties

2.2.5.2Common Open Properties

3Protocol Details

3.1Server Details

3.1.1Abstract Data Model

3.1.2Timers

3.1.3Initialization

3.1.4Higher-Layer Triggered Events

3.1.5Processing and Sequencing Rules

3.1.5.1Remote Windows Search Service Catalog Management

3.1.5.1.1Receiving a CPMCiStateInOut Request

3.1.5.2Remote Windows Search Service Querying

3.1.5.2.1Receiving a CPMConnectIn Request

3.1.5.2.2Receiving a CPMCreateQueryIn Request

3.1.5.2.3Receiving a CPMGetQueryStatusIn Request

3.1.5.2.4Receiving a CPMGetQueryStatusExIn Request

3.1.5.2.5Receiving a CPMRatioFinishedIn Request

3.1.5.2.6Receiving a CPMGetRowsIn Request

3.1.5.2.7Receiving a CPMFetchValueIn Request

3.1.5.2.8Receiving a CPMSetBindingsIn Request

3.1.5.2.9Receiving a CPMGetNotify Request

3.1.5.2.10Receiving a CPMGetApproximatePositionIn Request

3.1.5.2.11Receiving a CPMCompareBmkIn Request

3.1.5.2.12Receiving a CPMRestartPositionIn Request

3.1.5.2.13Receiving a CPMFreeCursorIn Request

3.1.5.2.14Receiving a CPMDisconnect Request

3.1.5.2.15Receiving a CPMFindIndicesIn Request

3.1.5.2.16Receiving a CPMGetRowsetNotifyIn

3.1.5.2.17Receiving a CPMGetScopeStatisticsIn

3.1.5.2.18Receiving a CPMSetScopePrioritizationIn

3.1.6Timer Events

3.1.7Other Local Events

3.2Client Details

3.2.1Abstract Data Model

3.2.2Timers

3.2.3Initialization

3.2.4Higher-Layer Triggered Events

3.2.4.1Remote Windows Search Service Catalog Management

3.2.4.1.1Sending a CPMCiStateInOut Request

3.2.4.2Remote Windows Search Service Catalog Query Messages

3.2.4.2.1Sending a CPMConnectIn Request

3.2.4.2.2Sending a CPMCreateQueryIn Request

3.2.4.2.3Sending a CPMSetBindingsIn Request

3.2.4.2.4Sending a CPMGetRowsIn Request

3.2.4.2.5Sending a CPMFetchValueIn Request

3.2.4.2.6Sending a CPMFreeCursorIn Request

3.2.4.2.7Sending a CPMDisconnect Message

3.2.4.2.8Sending a CPMFindIndicesIn Request

3.2.4.2.9Sending a CPMGetRowsetNotifyIn Request

3.2.4.2.10Sending a CPMGetScopeStatisticsIn Request

3.2.4.2.11Sending a CPMSetScopePrioritizationIn Request

3.2.5Processing and Sequencing Rules

3.2.5.1Receiving a CPMCreateQueryOut Response

3.2.5.2Receiving a CPMGetRowsOut Response

3.2.5.3Receiving a CPMFetchValueOut Response

3.2.5.4Receiving a CPMFreeCursorOut Response

3.2.5.5Receiving a CPMFindIndicesOut Response

3.2.5.6Receiving a CPMGetRowsetNotifyOut Response

3.2.5.7Receiving a CPMGetScopeStatisticsOut Response

3.2.5.8Receiving a CPMSetScopePrioritizationOut Response

3.2.6Timer Events

3.2.7Other Local Events

4Protocol Examples

4.1Example 1

5Security

5.1Security Considerations for Implementers

5.2Index of Security Parameters

6Appendix A: Product Behavior

7Change Tracking

8Index

1Introduction

This document specifies the Windows Search Protocol, which allows a client to issue queries to a server hosting a Generic Search service (GSS). The protocol is primarily intended to be used for full-text queries. It also allows an administrator to remotely manage the GSS.

This document specifies both remote querying and remote administration of GSS catalogs.

Sections 1.5, 1.8, 1.9, 2, and 3 of this specification are normative. All other sections and examples in this specification are informative.

1.1Glossary

This document uses the following terms:

binding: A request to include a particular column in a returned rowset. The binding specifies a property to be included in the search results.

bookmark: A marker that uniquely identifies a row within a set of rows.

catalog: The highest-level unit of organization in the Windows Search service. It represents a set of indexed documents against which queries can be executed by using the [MS-WSP].

category: A hierarchical grouping of rows. For example, a query result that contains author and title columns can be categorized based on author. Each group of rows containing the same value for author would constitute a category.

chapter: A range of rows within a set of rows.

column: The container for a single type of information in a row. Columns map to property names and specify what properties are used for the search query's command tree elements.

command tree: A combination of restrictions and sort orders that are specified for a search query.

cursor: An entity that is used as a mechanism to work with one row or a small block of rows (at one time) in a set of data returned in a result set. A cursor is positioned on a single row within the result set. After the cursor is positioned on a row, operations can be performed on that row or on a block of rows starting at that position.

Generic Search Service (GSS): A service that implements the back-end database functionality needed to provide results to search queries. The Windows Search Service is a Generic Search Service instance. The abstract interfaces that need to be implemented by a Generic Search Service are described in Local Events. These interfaces are called by the server in order to obtain appropriate responses to Windows Search Protocol messages.

Generic Security Services (GSS): An Internet standard, as described in [RFC2743], for providing security services to applications. It consists of an application programming interface (GSS-API) set, as well as standards that describe the structure of the security data.

globally unique identifier (GUID): A term used interchangeably with universally unique identifier (UUID) in Microsoft protocol technical documents (TDs). Interchanging the usage of these terms does not imply or require a specific algorithm or mechanism to generate the value. Specifically, the use of this term does not imply or require that the algorithms described in [RFC4122] or [C706] must be used for generating the GUID. See also universally unique identifier (UUID).

handle: A token that can be used to identify and access cursors, chapters, and bookmarks.

hierarchical group coordinate: A coordinate that defines the position of a result in a hierarchical grouping result set. The coordinate is a list of non-negative integers, one per category as defined in the CPMCreateQueryIn message and CCategorizationSet structure, followed by another positive integer representing the position of the result in the last (deepest) category.

HRESULT: An integer value that indicates the result or status of an operation. A particular HRESULT can have different meanings depending on the protocol using it. See [MS-ERREF] section 2.1 and specific protocol documents for further details.

indexing: The process of extracting text or properties from files and storing the extracted values in an index or property cache.

inverted index: A persistent structure that contains the text content pulled out of files during indexing. The text in an inverted index maps from a word in a property to a list of the documents and locations within a document that contain that word.

locale: An identifier, as specified in [MS-LCID], that specifies preferences related to language. These preferences indicate how dates and times are to be formatted, how items are to be sorted alphabetically, how strings are to be compared, and so on.

named pipe: A named, one-way, or duplex pipe for communication between a pipe server and one or more pipe clients.

natural language query: A query constructed using human language instead of query syntax. The generic search service (GSS) is free to interpret the query in order to determine the best results. The interpretation is explicitly not specified in order to allow improvements over time.

noise word: A word that is ignored by the Windows Search service (WSS) when present in the restrictions specified for the search query, because it has little discriminatory value. English examples include "a," "and," and "the." Implementers of a generic search service (GSS) MAY choose to follow this guideline.

object: A file, email, email attachment, contact, calendar appointment or any other self-contained item that can be indexed and searched for by the GSS.

path: When referring to a file path on a file system, a hierarchical sequence of folders. When referring to a connection to a storage device, a connection through which a machine can communicate with the storage device.

property cache: A cache of file or object properties extracted during indexing.

restriction: A set of conditions that a file must meet to be included in the search results returned by the Generic Search Service (GSS) in response to a search query. A restriction narrows the focus of a search query, limiting the files that the Generic Search Service (GSS) will include in the search results only to those files matching the conditions.

row: The collection of columns containing the property values that describe a single result from the set of objects that matched the restrictions specified in the search query submitted to the Generic Search Service (GSS).

rowset: A set of rows returned in the search results.

sort order: A set of rules in a search query that defines the ordering of rows in the search result. Each rule consists of a managed property, such as modified date or size, and a direction for order, such as ascending or descending. Multiple rules are applied sequentially.

Unicode: A character encoding standard developed by the Unicode Consortium that represents almost all of the written languages of the world. The Unicode standard [UNICODE5.0.0/2007] provides three forms (UTF-8, UTF-16, and UTF-32) and seven schemes (UTF-8, UTF-16, UTF-16 BE, UTF-16 LE, UTF-32, UTF-32 LE, and UTF-32 BE).

WHEREID: A 32-bit integer that uniquely identifies the query restriction. Cannot be equal to 0xFFFFFFFF.

Windows Search service (WSS): A service that creates indexedcatalogs for the contents and properties of file systems. Applications can search the catalogsfor information from the files on the indexed file system.

MAY, SHOULD, MUST, SHOULD NOT, MUST NOT: These terms (in all caps) are used as defined in [RFC2119]. All statements of optional behavior use either MAY, SHOULD, or SHOULD NOT.

1.2References

Links to a document in the Microsoft Open Specifications library point to the correct section in the most recently published version of the referenced document. However, because individual documents in the library are not updated at the same time, the section numbers in the documents may not match. You can confirm the correct section numbering by checking the Errata.

1.2.1Normative References

We conduct frequent surveys of the normative references to assure their continued availability. If you have any issue with finding a normative reference, please contact . We will assist you in finding the relevant information.

[IEEE754] IEEE, "IEEE Standard for Binary Floating-Point Arithmetic", IEEE 754-1985, October 1985,

[MS-DTYP] Microsoft Corporation, "Windows Data Types".

[MS-ERREF] Microsoft Corporation, "Windows Error Codes".

[MS-LCID] Microsoft Corporation, "Windows Language Code Identifier (LCID) Reference".

[MS-SMB2] Microsoft Corporation, "Server Message Block (SMB) Protocol Versions 2 and 3".

[MS-SMB] Microsoft Corporation, "Server Message Block (SMB) Protocol".

[RFC2119] Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", BCP 14, RFC 2119, March 1997,

[SALTON] Salton, G., "Automatic Text Processing: The Transformation Analysis and Retrieval of Information by Computer", 1988, ISBN: 0201122278.

[UNICODE] The Unicode Consortium, "The Unicode Consortium Home Page",

1.2.2Informative References

[Jones] Sparck Jones, K., Walker, S., and Robertson, S.E., "A Probabilistic Model of Information and Retrieval: Development and Status", September 1998, University of Cambridge Technical Report UCAM-CL-TR-446.

[MSDN-FULLPROPSPEC] Microsoft Corporation, "FULLPROPSPEC structure",

[MSDN-OLEDBP] Microsoft Corporation, "OLE DB Provider for Indexing Service",

[MSDN-PROPLIST] Microsoft Corporation, "Windows Properties",

1.3Overview

The WSS helps to efficiently organize the extracted features of a collection of documents. The Windows Search Protocol allows a client to communicate with a server hosting a GSS, both to issue queries and to allow an administrator to manage the indexing server.

When processing files, the WSS analyzes a set of documents, extracts useful information, and then organizes the extracted information in such a way that properties of those documents can be efficiently returned in response to queries. A collection of documents that can be queried comprises a catalog. A catalog can contain an inverted index (for quick word matching) and a property cache (for quick retrieval of property values).

Conceptually, a catalog consists of a logical "table" of properties with the text or value and corresponding locale stored in columns of the table. Each row of the table corresponds to a separate document in the scope of the catalog, and each column of the table corresponds to a property.

The specific tasks performed by the Windows Search Protocol are grouped into two functional areas: