Friday, June 20, 2025
No Result
View All Result
Coins League
  • Home
  • Bitcoin
  • Crypto Updates
    • Crypto Updates
    • Altcoin
    • Ethereum
    • Crypto Exchanges
  • Blockchain
  • NFT
  • DeFi
  • Metaverse
  • Web3
  • Scam Alert
  • Regulations
  • Analysis
Marketcap
  • Home
  • Bitcoin
  • Crypto Updates
    • Crypto Updates
    • Altcoin
    • Ethereum
    • Crypto Exchanges
  • Blockchain
  • NFT
  • DeFi
  • Metaverse
  • Web3
  • Scam Alert
  • Regulations
  • Analysis
No Result
View All Result
Coins League
No Result
View All Result

Optimizing IVF-PQ Performance with RAPIDS cuVS: Key Tuning Techniques

July 18, 2024
in Blockchain
Reading Time: 3 mins read
0 0
A A
0
Home Blockchain
Share on FacebookShare on TwitterShare on E Mail




Tony Kim
Jul 18, 2024 19:39

Learn to optimize the IVF-PQ algorithm for vector search efficiency utilizing RAPIDS cuVS, with sensible tips about tuning hyper-parameters and enhancing recall.





Within the first a part of the sequence, an outline of the IVF-PQ algorithm was offered, explaining its basis on the IVF-Flat algorithm and using Product Quantization (PQ) to compress the index and assist bigger datasets. Partially two, the main focus shifts to the sensible elements of tuning IVF-PQ efficiency, which is essential for attaining optimum outcomes, particularly when coping with billion-scale datasets.

Tuning Parameters for Index Constructing

IVF-PQ shares some parameters with IVF-Flat, resembling coarse-level indexing and search hyper-parameters. Nonetheless, IVF-PQ introduces further parameters that management compression. One of many essential parameters is n_lists, which determines the variety of partitions (inverted lists) into which the enter dataset is clustered. The efficiency is influenced by the variety of lists probed and their sizes. Experiments counsel that n_lists within the vary of 10K to 50K yield good efficiency throughout recall ranges, although this may fluctuate relying on the dataset.

One other essential parameter is pq_dim, which controls compression. Beginning with one fourth the variety of options within the dataset and growing in steps is an efficient approach for tuning this parameter. Determine 2 within the authentic weblog submit illustrates important drops in QPS, which will be attributed to components resembling elevated compute work and shared reminiscence necessities per CUDA block.

The pq_bits parameter, starting from 4 to eight, controls the variety of bits utilized in every particular person PQ code, affecting the codebook dimension and recall. Decreasing pq_bits can enhance search velocity by becoming the look-up desk (LUT) in shared reminiscence, though this comes at the price of recall.

Extra Parameters

The codebook_kind parameter determines how the codebooks for the second-level quantizer are constructed, both for every subspace or for every cluster. The selection between these choices can influence coaching time, GPU shared reminiscence utilization, and recall. Parameters resembling kmeans_n_iters and kmeans_trainset_fraction are additionally necessary, although they hardly ever want adjustment.

Tuning Parameters for Search

The n_probes parameter, mentioned within the earlier weblog submit on IVF-Flat, is crucial for search accuracy and throughput. IVF-PQ supplies further parameters like internal_distance_dtype and lut_dtype, which management the illustration of distance or similarity throughout the search and the datatype used to retailer the LUT, respectively. Adjusting these parameters can considerably influence efficiency, particularly for datasets with massive dimensionality.

Enhancing Recall with Refinement

When tuning parameters isn’t sufficient to realize the specified recall, refinement gives a promising different. This separate operation, carried out after the ANN search, recomputes actual distances for chosen candidates and reranks them. The refinement operation can considerably enhance recall, as demonstrated in Determine 4 of the unique weblog submit, although it requires entry to the supply dataset.

Abstract

The sequence on accelerating vector search with inverted-file indexes covers two cuVS algorithms: IVF-Flat and IVF-PQ. IVF-PQ extends IVF-Flat with PQ compression, enabling quicker searches and the flexibility to deal with billion-scale datasets with restricted GPU reminiscence. By fine-tuning parameters for index constructing and search, knowledge practitioners can obtain the very best outcomes effectively. The RAPIDS cuVS library gives a variety of vector search algorithms to cater to varied use instances, from actual searches to low-accuracy-high-QPS ANN strategies.

For sensible tuning of IVF-PQ parameters, confer with the IVF-PQ pocket book on GitHub. For extra particulars on the supplied APIs, see the cuVS documentation.

Picture supply: Shutterstock



Source link

Tags: cuVSIVFPQKeyOptimizingPerformanceRAPIDSTechniquesTuning
Previous Post

Haruko Raises $6M in Funding to Fuel Asia Growth Plans

Next Post

Shape, Creator-Focused Ethereum L2 in Optimism Superchain, Makes Testnet Available

Related Posts

Cathie Wood dumped $100M in Circle
Blockchain

Cathie Wood dumped $100M in Circle

June 19, 2025
CoreWeave and Weights & Biases Unveil New AI Development Tools
Blockchain

CoreWeave and Weights & Biases Unveil New AI Development Tools

June 19, 2025
Solana ETF got delayed
Blockchain

Solana ETF got delayed

June 18, 2025
Top 5 Reasons to Get Certified in Blockchain, AI, or Fintech Today
Blockchain

Top 5 Reasons to Get Certified in Blockchain, AI, or Fintech Today

June 18, 2025
Hugging Face Introduces Two AI-Powered Robots
Blockchain

Hugging Face Introduces Two AI-Powered Robots

June 17, 2025
How Bitfinex’s KYC Process Elevates Crypto Security Standards
Blockchain

How Bitfinex’s KYC Process Elevates Crypto Security Standards

June 18, 2025
Next Post
Shape, Creator-Focused Ethereum L2 in Optimism Superchain, Makes Testnet Available

Shape, Creator-Focused Ethereum L2 in Optimism Superchain, Makes Testnet Available

Pindrop Raises $100 Million to Fight Deepfakes

Pindrop Raises $100 Million to Fight Deepfakes

Why Taylor Swift Believes in Her Lucky Number

Why Taylor Swift Believes in Her Lucky Number

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Twitter Instagram LinkedIn RSS Telegram
Coins League

Find the latest Bitcoin, Ethereum, blockchain, crypto, Business, Fintech News, interviews, and price analysis at Coins League

CATEGORIES

  • Altcoin
  • Analysis
  • Bitcoin
  • Blockchain
  • Crypto Exchanges
  • Crypto Updates
  • DeFi
  • Ethereum
  • Metaverse
  • NFT
  • Regulations
  • Scam Alert
  • Uncategorized
  • Web3

SITEMAP

  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2023 Coins League.
Coins League is not responsible for the content of external sites.

No Result
View All Result
  • Home
  • Bitcoin
  • Crypto Updates
    • Crypto Updates
    • Altcoin
    • Ethereum
    • Crypto Exchanges
  • Blockchain
  • NFT
  • DeFi
  • Metaverse
  • Web3
  • Scam Alert
  • Regulations
  • Analysis

Copyright © 2023 Coins League.
Coins League is not responsible for the content of external sites.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In