Vector Search Over Your Data
Nomic Atlas enables you to search your dataset semantically with vector search.
Using Vector Search
Open the vector search modal by clicking its selection icon or using the hotkey 'V'.
data:image/s3,"s3://crabby-images/27067/270674d2cb962736222b1afafb9443617d30a8f2" alt="Vector Search Tool"
After submitting a successful search, a slider will appear for you to adjust. The slider is over similarity values (i.e. dot products) where the larger the value, the more similar the data point is to the search vector. Drag the slider to the left to include data points that are less similar and right for more similar. The percentages displayed shows the percentile number of data points captured between the similarity cutoffs. The Atlas map selection will automatically update based on the slider range.
data:image/s3,"s3://crabby-images/d37d1/d37d1be73e98ee35e487bc6c1f71a6980301d047" alt="Vector Search Results"
Combining Vector Search with Other Selections
Vector Search is part of the selection paradigm within Atlas. This means that out-of-the-box, you can combine your vector search filter with other Atlas tools to compose complex data selections.
Below, we join our vector search results with a search selection and lasso selection to find a subset of technology stock articles we are interested in.
data:image/s3,"s3://crabby-images/f556b/f556b92daf93182edb037ce99151f740c8b84469" alt="Vector Search Filter"
Vector Search Over Images
Nomic Text and Vision embedding models provide compatible, aligned embeddings (See details here). This means you can run text-to-image and image-to-text vector searches on your data (e.g. Find cat articles by providing a picture of a cat; find cat images that match the query "What animals are cute to cuddle with?").
data:image/s3,"s3://crabby-images/2ca46/2ca46ff4830f7c2cd3f28e339cf993a1443f76b2" alt="Vector Search Text-to-Image"
Vector Search API
Developers can leverage the Nomic API to run a vector search over their dataset. More examples and documentation for the endpoint are found in the API reference.
Here is an example of a vector search over this dataset:
curl -L 'https://api-atlas.nomic.ai/v1/query/topk' \
-H 'Content-Type: application/json' \
-H 'Authorization: Bearer $NOMIC_API_KEY' \
-d '{
"projection_id": "f1e499cd-b5c1-4d31-b38a-fff61e1f8b59",
"k": 3,
"fields": ["title", "average_rating", "price"],
"query": "footwear"
}'
The API endpoint returns a list of the closest items to the query from the dataset:
{
"data":
[
{
"title":"pediped Unisex-Child Flex Barbara School Uniform Shoe",
"average_rating":4.5,
"price":59.95000076293945,
"_similarity":0.7833542227745056
},
{
"title":"pediped Unisex-Child Flex Eclipse",
"average_rating":3.700000047683716,
"price":32.20000076293945,
"_similarity":0.7810658812522888
},
{
"title":"Cat Footwear Women's Brode St Shoe",
"average_rating":3.799999952316284,
"price":null,
"_similarity":0.7771036028862
}
]
}