diff --git a/PrefixACL.md b/PrefixACL.md new file mode 100644 index 0000000..540e368 --- /dev/null +++ b/PrefixACL.md @@ -0,0 +1,79 @@ +--- +RFC: (PR number) +Status: (Change to Proposed when it's ready for review) +--- + +# Title +Security Model for Search Indexes + +## Abstract + +The existing Valkey ACL-based security model must be extended to cover the Search module's concept of an index which spans multiple keys. + +## Motivation + +Users that desire to enforce data visibility and immutability need a security model that's easy to understand and configure. + +## Design considerations + +The key design considerations are: + +1. Overall performance should be unaffected if ACLs are not being used. +2. The security model should be simple to understand and therefore be unlikely to cause accidental misconfigurations. +3. The security model should be simple to implement to reduce odds of a bug. + +## Specification + +ACLs have two separate portions. One portion is granting/inhibiting access to commands either individually as part of a category. +No changes to the mechanism are required and it will be extended to cover the new search module commands and the new @search category. + +The second portion of the ACL machinery enables/disable access subsets of the keyspace for a user. +The search module creates indexes which contain data from a subset of the keyspace that is defined by one or more key prefixes (see the definition of FT.CREATE in the search RFC). +In order to enforce the the ACL keyspace restrictions, a user may only access or otherwise use a search index +if that user has read access to 100% of the keys that _might_ be in the index. +In other words, if the user's ACL prohibits access to any key that might be present in the index (any current or any potential future key), then access is blocked. Thus the search module cannot be used to access the contents of any key that the user is prohibited from directly reading herself. + +The implementation of this model is very straightforward. +Each search command that has an index (FT.CREATE, FT.SEARCH, FT.AGGREGATE, FT.DROPINDEX, FT.INFO), performs a security check on each keyspace (prefix) that is part of the index definition. +The security check is to ask the question whether the current user has access to 100% of the keys that could begin with the specific prefix. +If any of those checks fail then the command fails with a security error. +This is done at the start of each and every search command. +Thus, consistent with the Valkey ACL usage, if an ACL for a user is modified, then the next command issued by that user will be subjected to the new definition. + +No security information is ever cached or retained within the search module itself. + +### Authentication and Authorization + +No changes to the existing ACL machinery are proposed. Rather as described above, the search module ensures that no search command can be used to obtain access to keys which are not directly readable by the current user. + +### Implementation + +The search module will implement two methods of performing the required prefix access checks. +There is no functional difference between the two methods the only difference is the performance of search commands. + +The first method will be used when the Valkey core does not export the Module API proposed below. +This mode enables the Search module ACL enforcement to be fully operable with the existing 7.x and 8.x Valkey releases. +This method implements the security check by performing a module CALL operation to fetch the ACL description of the current user and then to parse that result to determine the keyspace patterns enabled for the current user. + +The second method is used when the proposed module API below is present. In which case, the security check is done within the core in a more efficient manner. + +### Module API + +The following module API is proposed to be implemented in the next Valkey release: + +``` +enum KeyPrefixCheckFlags { + VALKEYMODULE_PREFIX_CHECK_READ_ACCESS = 1, +} + +int ValkeyModule_AclCheckKeyPrefixPermissions(ValkeyModuleContext *ctx, const char *prefix, size_t bytes, size_t flags) +``` + +The ```prefix``` parameter points to the byte string to check with its length specified by the ```bytes``` parameter. +The ```flags``` indicates the type of access for the check. Currently only one flag is defined and it must be specified. + +The return value indicates whether access is permitted (C_OK) or not (C_ERR) for the current user. +This API doesn't log anything or generate any security error message, that's the responsibility of the caller. + +The implementation of this command is straightforward. The prefix is matched against each key pattern in turn, if the pattern doesn't match the entire prefix (either with explicit characters or with globing) or if there are non-wildcard characters after the prefix then the match fails and C_ERR is returned. +Only if all of the patterns pass the test is C_OK returned. diff --git a/VSS.md b/VSS.md new file mode 100644 index 0000000..5a5ffc2 --- /dev/null +++ b/VSS.md @@ -0,0 +1,518 @@ +## RFC: 8 + +## Status: Proposed + + +# Valkey Search RFC + +## Abstract + +This proposed Valkey Search module will introduce search algorithms to provide numeric, tag and vector search capabilities on both Valkey Cluster and Valkey Standalone. With search on Valkey, users can take advantage of low latency in-memory search in their familiar Valkey environment. + +## Motivation + +With the rising popularity of generative AI, developers have turned to in-memory data stores such as Valkey or Redis for ultra-low latency (p99 single digit ms) vector search for use cases like Retrieval Augmented Generation (RAG), recommendation systems, semantic search, semantic caching, and more. For developers already using Valkey or Redis, co-locating their vectors and search functionality alongside their underlying data will accelerate and simplify adoption. And for other “greenfield” users, the ultra-low latency of Valkey search may attract them to replace their existing database for high performance use cases to achieve in-memory speeds. + +## Design considerations + +Here we list Critical User Journeys (CUJs) which we believe Valkey users will desire as part of a Search Module. + +**Search Operations CUJ** \- as a user, I want to perform search operations using commonly available clients. + +- Requirement: support index insertion/mutation/deletion/query operations +- Requirement: maintain compatibility with RediSearch VSS APIs, Memorystore APIs, and MemoryDB APIs as much as possible + +**Performance CUJ** \- As a user, I want to use in-memory search because it provides the lowest latency, and most performant option + +- Requirement: low latency data ingestion +- Requirement: low latency queries +- Requirement: high throughput: query \+ ingestion +- Requirement: high throughput at high levels of recall +- Requirement: \[Scale up\] multi-threading & vertical scaling (query \+ ingestion) +- Requirement: Negligible resource consumption (CPU, memory) for Valkey applications and operations/commands that do not use the search module. + +**API Compatibility CUJ** \- As a user, I want a compatible API to RediSearch VSS, which is supported by common clients. The goal is to enable applications to be portable between Redisearch and Valkey search with minimal effort. The search module will strive for the highest level of compatibility with Redisearch API for core application operations that are valid, deterministic and error-free. Other items, e.g., management or configuration commands, timing dependent operations, programmer development commands, status commands and/or error messages for invalid operations may differ substantially. + +Binary RDB and replication stream compatibility with Redisearch is not needed, meaning that online/live data migration will not be possible. + +**HASH and JSON Data Type support CUJ** \- As a user, I want to store my vectors in a HASH or JSON data type, which is natively supported in Valkey. + +**Scale Up & Out CUJ** \- As a user, I want to be able to both scale up and out + +- Requirement: multi-threading for scale up +- Requirement: Cluster for scale out & storing more data + +**Consistency CUJ** \- As a user, I want to be able to select a consistency model that matches my needs. + +- Requirement: A sequential consistency model must be supported. + +**Memory footprint CUJ** \- As a user, I want the index to consume memory based on usage and dynamically grow + +- Requirement: Lazy memory consumption based on the index entries and the dimensions. +- Requirement: Index size should be allowed to grow dynamically with minimal performance overhead + +**Replication CUJ** \- As a user, I want to rely on replication to provide high-availability and ensure that my data is protected by storing it across multiple nodes with low replication latency + +- Requirement: Support fast bootstrap +- Requirement: Support index full sync on replication lag. + +**Monitoring CUJ** \- As a user, I need to be able to monitor indexes and re-indexing with simple observability metrics + +- Requirement: Ability to monitor index base statistics +- Requirement: Ability to monitor cross indexes, aggregated, view statistics. + +**Backfill CUJ** \- As a user, I need to be able to create an index after the keyspace contains vector entries + +- Requirement: Index creation triggers a backfill process which indexes the existing keyspace entries. + +**Hybrid Query CUJ** \- As a user, I want to be able to perform hybrid queries to incorporate filtering criteria in addition to best-match vector search + +- Requirement: Support for filtering numeric and tag-based filters. +- Requirement: Support for boolean combinations of operators in combination with filtering (AND, OR, NOT) + +**Result Set Management CUJ** \- As a user I want to dictate which fields are returned in a search + +- Requirement: Support query controls to set the desired returned fields. + +**ACLs CUJ** \- As a user of Access Control Lists (ACLs), I want to manage user-based permissions, specific command permissions, etc to manage vector search access + +- Requirement: Existing ACls functionality should extend to vector search commands + +**Persistence CUJ** \- As a user I want to be able to leverage RDB and/or AOF for persisting and loading indexes + +- Requirement: Index metadata should be persisted leveraging RDB +- Requirement: Index types that take significant time to rebuild, such as vector indexes, must support persistence of the index payload. + +**Tuning CUJ** \- As a user who is also managing the infrastructure, I want to tune my configurations, like number of threads, to optimize my search performance + +- Requirement: Configs like number of threads should be exposed and tunable to users + +## Current Specification + +The search module allows a user to create any number of indexes. Each index consists of any number of typed fields and covers a specified subset of the keys within a Valkey database. + +The creation of an index initiates a backfill process in the background that will scan the keyspace and insert all covered keys into the newly created index. Mutations of a key are automatically reflected in the index. This process is automatic after the creation of the index. Mutation of keys is explicitly allowed during the backfill process. The current state of a backfill may be interrogated by a client using the FT.INFO command. + +A query operation is a two step process. In the first step, a query is executed that generates a set of keys and distances. In the second step, that list of keys and distances is used to generate a response. + +The first step is performed in a background thread, while the second step is performed by the mainthread. If a key produced by step 1 is mutated before step 2 is executed by the mainthread, the result of the query may not be consistent with the latest state of the keys. + +### Cluster Mode Operation + +In cluster mode, indexes are sharded just like data, meaning that each node maintains an index only for the keys that are present on that node. +This means for that for data mutation operations, no cross-cluster communication is required, each node simply updates it's local index to reflect the mutation of the local key. + +Query operations are accepted by any node in the cluster and that node is responsible for broadcasting the query across the cluster and merging the results for delivery to the client. +Cross-client communication uses gRPC and protobufs and does not require mainthread interaction. + +Similar to query operations, index metadata operations, i.e., `FT.CREATE` and `FT.DROPINDEX` are accepted by any node in the cluster and broadcast to every other node in the cluster using gRPC. +Eventual consistency of index metadata amongst the cluster nodes is maintained through the use of the cluster bus. +Periodically, a checksum of the index metadata is broadcast across the cluster such that inconsistent metadata can be detected and corrected. + +The `FT.INFO` command returns information only for the current node. + +## Current Restrictions + +- `FT.SEARCH` can not be executed within a LUA function. +- `FT.SEARCH` assumes that READONLY has been specified, meaning that broadcasts across the cluster may select either primaries or replicas for each shard. +- The RDB file format is not compatible with Redisearch +- ACLs are not supported + + +### Commands + +``` +FT.CREATE + ON HASH + [PREFIX [...]] + SCHEMA + ( + [AS ] + NUMERIC + | TAG [SEPARATOR ] [CASESENSITIVE] + | VECTOR [HNSW | FLAT] [ ]+) + )+ +``` + +The `FT.CREATE` command creates an empty index and initiates the backfill process. Each index consists of a number of field definitions. Each field definition specifies a field name, a field type and a path within each indexed key to locate a value of the declared type. Some field type definitions have additional sub-type specifiers. + +For indexes on HASH keys, the path is the same as the hash member name. The optional `AS` clause can be used to rename the field if desired. +Renaming of fields is especially useful when the member name contains special characters. + +For indexes on JSON keys, the path is a JSON path to the data of the declared type. Because the JSON path always contains special characters, the `AS` clause is required. + +- **\** (required): This is the name you give to your index. If an index with the same name exists already, an error is returned. + +- **ON HASH | JSON** (optional): Only keys that match the specified type are included into this index. If omitted, HASH is assumed. + +- **PREFIX \ \** (optional): If this clause is specified, then only keys that begin with the same bytes as one or more of the specified prefixes will be included into this index. If this clause is omitted, all keys of the correct type will be included. A zero-length prefix would also match all keys of the correct type. + +- Field types: + + - **TAG**: A tag field is a string that contains one or more tag values. + + - **SEPARATOR \** (optional): One of these characters `,.<>{}[]"':;!@#$%^&*()-+=~` used to delimit individual tags. If omitted the default value is `,`. + - **CASESENSITIVE** (optional): If present, tag comparisons will be case sensitive. The default is that tag comparisons are NOT case sensitive + + + + - **NUMERIC** Field contains a number. + + - **VECTOR**: Field contains a vector. Two vector indexing algorithms are currently supported: HNSW (Hierarchical Navigable Small World) and FLAT (brute force). Each algorithm has a set of additional attributes, some required and other optional. + + - **FLAT:** The Flat algorithm provides exact answers, but has runtime proportional to the number of indexed vectors and thus may not be appropriate for large data sets. + - **DIM \** (required): Specifies the number of dimensions in a vector. + - **TYPE FLOAT32** (required): Data type, currently only FLOAT32 is supported. + - **DISTANCE\_METRIC \[L2 | IP | COSINE\]** (required): Specifies the distance algorithm + - **INITIAL\_CAP \** (optional): Initial index size. + - **HNSW:** The HNSW algorithm provides approximate answers, but operates substantially faster than FLAT. + - **DIM \** (required): Specifies the number of dimensions in a vector. + - **TYPE FLOAT32** (required): Data type, currently only FLOAT32 is supported. + - **DISTANCE\_METRIC \[L2 | IP | COSINE\]** (required): Specifies the distance algorithm + - **INITIAL\_CAP \** (optional): Initial index size. + - **M \** (optional): Number of maximum allowed outgoing edges for each node in the graph in each layer. on layer zero the maximal number of outgoing edges will be 2\*M. Default is 16, the maximum is 512\. + - **EF\_CONSTRUCTION \** (optional): controls the number of vectors examined during index construction. Higher values for this parameter will improve recall ratio at the expense of longer index creation times. The default value is 200\. Maximum value is 4096\. + - **EF\_RUNTIME \** (optional): controls the number of vectors to be examined during a query operation. The default is 10, and the max is 4096\. You can set this parameter value for each query you run. Higher values increase query times, but improve query recall. + +**RESPONSE** OK or error. + +``` +FT.DROPINDEX +``` + +The specified index is deleted. It is an error if that index doesn't exist. + +- **\** (required): The name of the index to delete. + +**RESPONSE** OK or error. + +``` +FT.INFO +``` + +Detailed information about the specified index is returned. + +- **\** (required): The name of the index to return information about. + +**RESPONSE** + +An array of key value pairs. + +- **index\_name** (string) The index name +- **num\_docs** (integer) Total keys in the index +- **num\_records** (integer) Total records in the index +- **hash\_indexing\_failures** (integer) Count of unsuccessful indexing attempts +- **indexing** (integer) Binary value. Shows if background indexing is running or not +- **percent\_indexed** (integer) Progress of background indexing. Percentage is expressed as a value from 0 to 1 +- **index\_definition** (array) An array of values defining the index + - **key\_type** (string) HASH. This is the only available key type. + - **prefixes** (array of strings) Prefixes for keys + - **default\_score** (integer) This is the default scoring value for the vector search scoring function, which is used for sorting. + - **attributes** (array) One array of entries for each field defined in the index. + - **identifier** (string) field name + - **attribute** (string) An index field. This is correlated to a specific index HASH field. + - **type** (string) VECTOR. This is the only available type. + - **index** (array) Extended information about this internal index for this field. + - **capacity** (integer) The current capacity for the total number of vectors that the index can store. + - **dimensions** (integer) Dimension count + - **distance\_metric** (string) Possible values are L2, IP or Cosine + - **data\_type** (string) FLOAT32. This is the only available data type + - **algorithm** (array) Information about the algorithm for this field. + - **name** (string) HNSW or FLAT + - **m** (integer) The count of maximum permitted outgoing edges for each node in the graph in each layer. The maximum number of outgoing edges is 2\*M for layer 0\. The Default is 16\. The maximum is 512\. + - **ef\_construction** (integer) The count of vectors in the index. The default is 200, and the max is 4096\. Higher values increase the time needed to create indexes, but improve the recall ratio. + - **ef\_runtime** (integer) The count of vectors to be examined during a query operation. The default is 10, and the max is 4096\. + +``` +FT._LIST +``` + +Lists the currently defined indexes. + +**RESPONSE** + +An array of strings which are the currently defined index names. + +``` +FT.SEARCH + [NOCONTENT] + [TIMEOUT ] + [PARAMS nargs [ ...]] + [LIMIT ] + [DIALECT ] +``` + +Performs a search of the specified index. The keys which match the query expression are returned. + +- **\** (required): This index name you want to query. +- **\** (required): The query string, see below for details. +- **NOCONTENT** (optional): When present, only the resulting key names are returned, no key values are included. +- **TIMEOUT \** (optional): Lets you set a timeout value for the search command. This must be an integer in milliSeconds. +- **PARAMS \ \ \ \ \ ...** (optional): `count` is of the number of arguments, i.e., twice the number of value name pairs. See the query string for usage details. +- **RETURN \ \ \ ...** (options): `count` is the number of fields to return. Specifies the fields you want to retrieve from your documents, along with any aliases for the returned values. By default, all fields are returned unless the NOCONTENT option is set, in which case no fields are returned. If num is set to 0, it behaves the same as NOCONTENT. +- **LIMIT \ \** (optional): Lets you choose a portion of the result. The first `` keys are skipped and only a maximum of `` keys are included. The default is LIMIT 0 10, which returns at most 10 keys. +- **DIALECT \** (optional): Specifies your dialect. The only supported dialect is 2\. + +**RESPONSE** + +The command returns either an array if successful or an error. + +On success, the first entry in the response array represents the count of matching keys, followed by one array entry for each matching key. +Note that if the `LIMIT` option is specified it will only control the number of returned keys and will not affect the value of the first entry. + +When `NOCONTENT` is specified, each entry in the response contains only the matching keyname, Otherwise, each entry includes the matching keyname, followed by an array of the returned fields. + +The result fields for a key consists of a set of name/value pairs. The first name/value pair is for the distance computed. The name of this pair is constructed from the vector field name prepended with "\_\_" and appended with "\_score" and the value is the computed distance. The remaining name/value pairs are the members and values of the key as controlled by the `RETURN` clause. + +The query string conforms to this syntax: + +``` +=>[ KNN @ $ ] +``` + +Where: + +- **\** Is either a `*` or a filter expression. A `*` indicates no filtering and thus all vectors within the index are searched. A filter expression can be provided to designate a subset of the vectors to be searched. +- **\** The name of a vector field within the specified index. +- **\** The number of nearest neighbor vectors to return. +- **\** A PARAM name whose corresponding value provides the query vector for the KNN algorithm. Note that this parameter must be encoded as a 32-bit IEEE 754 binary floating point in little-endian format. +- **\** (Optional) A list of keyword/value pairs that modify this particular KNN search. Currently two keywords are supported: + - **EF_RUNTIME** This keyword is accompanied by an integer value which overrides the default value of **EF_RUNTIME** specified when the index was created. + - **AS** This keyword is accompanied by a string value which becomes the name of the score field in the result, overriding the default score field name generation algorithm. + +**Filter Expression** + +A filter expression is constructed as a logical combination of Tag and Numeric search operators contained within parenthesis. + +**Tag** + +The tag search operator is specified with one or more strings separated by the `|` character. A key will satisfy the Tag search operator if the indicated field contains any one of the specified strings. + +``` +@:{} +or +@:{ | } +or +@:{ | | ...} +``` + +For example, the following query will return documents with blue OR black OR green color. + +`@color:{blue | black | green}` + +As another example, the following query will return documents containing "hello world" or "hello universe" + +`@color:{hello world | hello universe}` + +**Numeric Range** + +Numeric range operator allows for filtering queries to only return values that are in between a given start and end value. Both inclusive and exclusive range queries are supported. For simple relational comparisons, \+inf, \-inf can be used with a range query. + +The syntax for a range search operator is: + +``` +@:[ [(] [(] ] +``` + +where \ is either a number or \+inf or \-inf + +Bounds without a leading open paren are inclusive, whereas bounds with the leading open paren are exclusive. + +Use the following table as a guide for mapping mathematical expressions to filtering queries: + +``` +min <= field <= max @field:[min max] +min < field <= max @field:[(min max] +min <= field < max @field:[min (max] +min < field < max @field:[(min (max] +field >= min @field:[min +inf] +field > min @field:[(min +inf] +field <= max @field:[-inf max] +field < max @field:[-inf (max] +field == val @field:[val val] +``` + +**Logical Operators** + +Multiple tags and numeric search operators can be used to construct complex queries using logical operators. + +**Logical AND** + +To set a logical AND, use a space between the predicates. For example: + +``` +query1 query2 query3 +``` + +**Logical OR** + +To set a logical OR, use the `|` character between the predicates. For example: + +``` +query1 | query2 | query3 +``` + +**Logical Negation** + +Any query can be negated by prepending the `-` character before each query. Negative queries return all entries that don't match the query. This also includes keys that don't have the field. + +For example, a negative query on @genre:{comedy} will return all books that are not comedy AND all books that don't have a genre field. + +The following query will return all books with "comedy" genre that are not published between 2015 and 2024, or that have no year field: + +@genre:\[comedy\] \-@year:\[2015 2024\] + +**Operator Precedence** + +Typical operator precedence rules apply, i.e., Logical negate is the highest priority, followed by Logical and and then Logical Or with the lowest priority. Parenthesis can be used to override the default precedence rules. + +**Examples of Combining Logical Operators** + +Logical operators can be combined to form complex filter expressions. + +The following query will return all books with "comedy" or "horror" genre (AND) published between 2015 and 2024: + +`@genre:[comedy|horror] @year:[2015 2024]` + +The following query will return all books with "comedy" or "horror" genre (OR) published between 2015 and 2024: + +`@genre:[comedy|horror] | @year:[2015 2024]` + +The following query will return all books that either don't have a genre field, or have a genre field not equal to "comedy", that are published between 2015 and 2024: + +`-@genre:[comedy] @year:[2015 2024]` + +### Configuration + +1\. **reader-threads:** (Integer) Controls the amount of threads executing queries. +2\. **writer-threads:** (Integer) Controls the amount of threads processing index mutations. +3\. **use-coordinator:** (boolean) Cluster mode enabler. + +### Dependencies + +1. [Abseil](https://github.com/abseil/abseil-cpp): A collection of C++ libraries designed to augment the standard library. +2. [gRPC](https://github.com/grpc/grpc): A high-performance, open-source universal RPC framework. + + +Additionally, the codebase integrates a forked version of hnswlib. + +### Testing + +1. **Unittesting:** code coverage currently stands at 90%. +2. **Stability test framework:** fuzzy testing based on python. + +### Observability + +The following metrics are added to the INFO command. + +- **search\_total\_indexed\_hash\_keys** (Integer) Total count of HASH keys for all indexes +- **search\_number\_of\_indexes** (Integer) Index schema total count +- **search\_number\_of\_attributes** (Integer) Total count of attributes for all indexes +- **search\_failure\_requests\_count** (Integer) A count of all failed requests, including syntax errors. +- **search\_successful\_requests\_count** (Integer) A count of all successful requests +- **search\_used\_memory\_human** (Integer) A human-friendly readable version of the search\_used\_memory\_bytes metric +- **search\_used\_memory\_bytes** (Integer) The total bytes of memory that all indexes occupy +- **search\_background\_indexing\_status** (String) The status of the indexing process. NO\_ACTIVITY indicates idle indexing. +- **search\_hnsw\_create\_exceptions\_count** (Integer) Count of HNSW creation exceptions. +- **search\_hnsw\_search\_exceptions\_count** (Integer) Count of HNSW search exceptions +- **search\_hnsw\_remove\_exceptions\_count** (Integer) Count of HNSW removal exceptions. +- **search\_hnsw\_add\_exceptions\_count** (Integer) Count of HNSW addition exceptions. +- **search\_hnsw\_modify\_exceptions\_count** (Integer) Count of HNSW modification exceptions +- **search\_modify\_subscription\_skipped\_count** (Integer) Count of skipped subscription modifications +- **search\_remove\_subscription\_successful\_count** (Integer) Count of successful subscription removals +- **search\_remove\_subscription\_skipped\_count** (Integer) Count of skipped subscription removals +- **search\_remove\_subscription\_failure\_count** (Integer) Count of failed subscription removals +- **search\_add\_subscription\_successful\_count** (Integer) Count of successfully added subscriptions +- **search\_add\_subscription\_failure\_count** (Integer) Count of failures of adding subscriptions +- **search\_add\_subscription\_skipped\_count** (Integer) Count of skipped subscription adding processes +- **search\_modify\_subscription\_failure\_count** (Integer) Count of failed subscription modifications +- **search\_modify\_subscription\_successful\_count** (Integer) Count of successful subscription modifications + +### Key Functionality Gaps Relative to RediSearch + +ValkeySearch's current focus is primarily on vector search, leveraging a robust foundation designed for extensibility. It is highly optimized for performance, delivering nearly linear scaling with an increasing number of CPU cores. Furthermore, ValkeySearch employs advanced techniques such as deduplication and string interning to maintain a low memory footprint. However, RediSearch, developed over several years, offers a more comprehensive feature set. The following outlines the key gaps: + +**Unsupported Index Types** +- Full-Text: Allows fast querying of textual data, supporting features such as exact phrase matching, prefix matching, fuzzy matching and more. +- Geospatial: Allows storing, querying, and manipulating location-based data such as radius queries, nearest neighbor searches, and distance calculations, leveraging sorted sets and geohashes. + +**Functionality Gaps** +- Non-vector queries: Currently, non-vector indexes are utilized only within the context of vector search queries. Expanding support to standalone non-vector use cases broadens applicability. For instance, a query could search for products priced within a specific range, tagged as "sport," and with a title containing the word "basketball." An example syntax for such a query might look like: + +``` +FT.SEARCH index_name "@price:[min max] @tags:{sport} @title:basketball" +``` + +- ACLs: allowing query and index operations to respect the permissions configured for each user. + +**Unsupported Commands** + +- FT.AGGREGATE: Allows running aggregation queries on an index, returning grouped and processed results. +- FT.ALIAS[ADD, UPDATE, DELETE]: Allows managing aliases for existing indexes. +- FT.PROFILE: Enables running a query while collecting and displaying execution details. +- FT.EXPLAIN: Enables viewing the query execution plan for debugging purposes. +- FT.EXPLAINCLI: Allows CLI-friendly query explanation output. +- FT.ALTER: Enables modification of an existing index by adding new fields. +- FT.SYNUPDATE, FT.SYNDUMP, FT.TAGVALS: Tag and synonym management. +- FT.CURSOR: Allows management of aggregation cursors (e.g., read, delete). +- FT.CONFIG: Enables configuration of server-wide or index-specific parameters. +- FT.SUG[ADD, GET, DEL, LEN]: Suggestion management + + + +**Unsupported Knobs & Controls** + +The current implementation covers only a subset of the controls for FT.SEARCH and FT.CREATE. A complete list of supported commands and features can be found here: + +**FT.SEARCH** + +- SORTBY: Allows sorting the results based on a specific sortable field in ascending or descending order. +- WITHSCORES: Enables returning the score of each result alongside the document, which indicates its relevance. +- WITHPAYLOADS: Allows retrieving the payloads associated with documents, if they exist. +- WITHSORTKEYS: Enables returning the sort keys used for ordering results when the SORTBY option is used. +- HIGHLIGHT: Allows marking matching text fragments in the results for easier identification using a custom tag. +- SUMMARIZE: Enables returning a summarized version of document fields, specifying the number of fragments and length. +- INKEYS: Allows restricting the search to a specific set of document keys. +- INFIELDS: Enables limiting the search to specified fields in the index schema. +- LANGUAGE: Enables specifying the language for stemming and query parsing. +- EXPANDER: Allows using a custom query expander to modify the query before execution. +- SCORER: Enables specifying a custom scoring function to influence the order of the results. +- VERBATIM: Allows disabling query expansion and stemming, ensuring an exact match for the terms. +- NOSTEM: Enables disabling stemming for individual terms in the query, preserving their original form. +- INORDER: Allows requiring that query terms appear in the same order as specified in the query. +- SLOP: Enables setting the maximum number of intervening terms allowed between query terms for phrase matching. +- EXPLAINSCORE: Allows returning an explanation of the score for each result to understand its relevance. + +**FT.CREATE** + +- LANGUAGE: Enables setting the default language for query parsing and stemming when none is specified. +- LANGUAGE_FIELD: Allows designating a field in the document to specify the language for stemming. +- SCORE: Enables setting a default score for documents in the index to prioritize relevance. +- SCORE_FIELD: Allows specifying a document field to provide a custom score dynamically. +- PAYLOAD_FIELD: Enables specifying a field in the document to store arbitrary payload data for each document. +- MAXTEXTFIELDS: Allows enabling support for more than 32 text fields in the schema. +- TEMPORARY: Enables creating a temporary index that expires after a specified number of seconds. +- NOOFFSETS: Allows disabling offset data for text fields, reducing memory usage at the cost of missing term highlighting and exact snippet matching. +- NOHL: Enables disabling support for highlighting, which saves memory but removes term highlighting capabilities. +- NOFIELDS: Allows disabling the return of document fields, reducing memory and network usage. + + +## Appendix + +Google Cloud Blogs + +- [https://cloud.google.com/blog/products/databases/memorystore-for-redis-vector-search-and-langchain-integration](https://cloud.google.com/blog/products/databases/memorystore-for-redis-vector-search-and-langchain-integration) +- [https://cloud.google.com/blog/products/databases/memorystore-for-redis-cluster-updates-at-next24](https://cloud.google.com/blog/products/databases/memorystore-for-redis-cluster-updates-at-next24) + +Google Cloud Docs + +- [https://cloud.google.com/memorystore/docs/cluster/about-vector-search](https://cloud.google.com/memorystore/docs/cluster/about-vector-search) + +AWS Blogs + +- [https://aws.amazon.com/blogs/aws/vector-search-for-amazon-memorydb-is-now-generally-available/](https://aws.amazon.com/blogs/aws/vector-search-for-amazon-memorydb-is-now-generally-available/) +- [https://aws.amazon.com/about-aws/whats-new/2023/11/vector-search-amazon-memorydb-redis-preview/](https://aws.amazon.com/about-aws/whats-new/2023/11/vector-search-amazon-memorydb-redis-preview/) + +AWS Docs + +- [https://docs.aws.amazon.com/memorydb/latest/devguide/vector-search.html](https://docs.aws.amazon.com/memorydb/latest/devguide/vector-search.html)