Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Implementation] Remote Vector Index Builder - Create Vectors Dataset and Upload Index Tasks #12

Open
2 tasks done
rchitale7 opened this issue Feb 25, 2025 · 0 comments · May be fixed by #17
Open
2 tasks done
Assignees
Labels
enhancement New feature or request Roadmap:Vector Database/GenAI Project-wide roadmap label untriaged

Comments

@rchitale7
Copy link
Member

rchitale7 commented Feb 25, 2025

Description

This issue tracks the implementation of the create_vectors_dataset and upload_index tasks - these tasks handle downloading vectors from a Remote Store repository, and uploading the knn index to the Remote Store repository, respectively. The build_index task implementation will be tracked in a separate issue. All 3 of these tasks are utilized by the Remote Vector Index Builder Service to construct a knn index on hardware accelerators (such as GPUs), for an Opensearch cluster. For more background on the Remote Vector Index Builder Service, please read: opensearch-project/k-NN#2293

The tasks are part of the core Remote Vector Index Builder Python library.In the initial implementation, only the s3 Remote Store repository will be supported. However, the tasks must be designed to support other repositories (such as Azure Blob Storage, GCP) in the future.

Sub-tasks

  • Create a draft PR implementation, get feedback from others
  • Integrate feedback, add unit tests, and do benchmarking - finalize implementation
@rchitale7 rchitale7 added enhancement New feature or request Roadmap:Vector Database/GenAI Project-wide roadmap label labels Feb 25, 2025
@rchitale7 rchitale7 self-assigned this Feb 25, 2025
@rchitale7 rchitale7 changed the title [Implementation] Remote Vector Index Builder - Object Store Client [Implementation] Remote Vector Index Builder - Create Vectors Dataset and Upload Index Tasks Feb 25, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request Roadmap:Vector Database/GenAI Project-wide roadmap label untriaged
Projects
Status: New
Development

Successfully merging a pull request may close this issue.

1 participant