CAGRA configuration

更新时间:
复制 MD 格式

This page describes the index build parameters and index search parameters for the CAGRA algorithm.

Index build parameters

ParameterTypeDefault valueDescription
proxima.cagra.index.graph_degreeint64Number of edges per node in the nearest neighbor graph during queries. Higher values improve recall but reduce throughput.
proxima.cagra.index.intermediate_graph_degreeint128Number of edges per node in the nearest neighbor graph during the graph creation phase. The graph is pruned down to graph_degree at query time. Must be greater than or equal to graph_degree and no more than 1.5x graph_degree.
proxima.cagra.index.mode.fp16int0Specifies whether to enable FP16 quantization. Set to 1 to enable. Enabling FP16 mode is recommended to avoid degrading recall.

Index search parameters

ParameterTypeDefault valueDescription
proxima.cagra.search.itopk_sizeint64Size of the internal candidate set during each iteration. This is the primary parameter for tuning the recall-throughput trade-off. Must be a multiple of 32 and greater than or equal to your target top-k value. For example, set to at least 100 when retrieving top-100 results. Higher values improve recall but reduce throughput.
proxima.cagra.search.use_batchable_searcherint0Specifies whether to enable batch processing. Set to 1 to enable batch processing. This way, the system can process query requests that arrive within a short time window in a batch, which significantly improves throughput.
proxima.cagra.search.use_gpu_built_indexint0Specifies the index source. Only 0 is supported, which uses the graph built by the CPU.