Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: add support for DataStax HCD vector store #3728

Merged
merged 1 commit into from
Sep 9, 2024

Conversation

bradfordcp
Copy link
Contributor

Added support for DataStax Hyper-Converged Database (HCD) backed vector store.

@dosubot dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. enhancement New feature or request labels Sep 9, 2024
@github-actions github-actions bot added enhancement New feature or request and removed enhancement New feature or request labels Sep 9, 2024
Copy link

This pull request is automatically being deployed by Amplify Hosting (learn more).

Access this pull request here: https://pr-3728.dmtpw4p5recq1.amplifyapp.com

@dosubot dosubot bot added the lgtm This PR has been approved by a maintainer label Sep 9, 2024
@italojohnny italojohnny self-requested a review September 9, 2024 12:56
class HCDVectorStoreComponent(LCVectorStoreComponent):
display_name: str = "Hyper-Converged Database"
description: str = "Implementation of Vector Store using Hyper-Converged Database (HCD) with search capabilities"
documentation: str = "https://python.langchain.com/docs/integrations/vectorstores/astradb"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is there any specific HCD documentation we want to link to?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good point, we can link to our install docs. It's worth noting that we are using AstraDB langchain integration so that URL is valid despite it talking to HCD.



class HCDVectorStoreComponent(LCVectorStoreComponent):
display_name: str = "Hyper-Converged Database"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you clarify how we'll use HCD re: naming? Do we want it to be so strongly associated with Datastax that we don't need to specify "Datastax HCD"?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I asked about this internally and the decision was to keep it as Hyper-Converged Database. Thanks for checking!

name="api_endpoint",
display_name="HCD API Endpoint",
info="API endpoint URL for the HCD service.",
value="HCD_API_ENDPOINT",
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does langchain-astradb support interpreting this env variable as the endpoint (instead of ASTRA_DB_API_ENDPOINT?). I think this naming makes sense either way though, and should be supported in langchain-astradb if not.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Unfortunately no, langchain-astradb only looks for ASTRA_DB_API_ENDPOINT. There is not associated HCD_API_ENDPOINT, instead we set an environment parameter during client initialization environment=Environment.HCD.

),
HandleInput(
name="embedding",
display_name="Embedding or Astra Vectorize",
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is vectorize supported in HCD?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I believe the code paths are present, but I am unclear on how it is configured at this time.

@italojohnny italojohnny merged commit ce11df2 into langflow-ai:main Sep 9, 2024
35 of 37 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request lgtm This PR has been approved by a maintainer size:L This PR changes 100-499 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants