Mongo DB
Last updated
Was this helpful?
Last updated
Was this helpful?
This document provides clear steps on how to use and integrate MongoDB with Superlinked.
To integrate MongoDB with Superlinked, ensure you are using a version that supports Atlas Vector Search capabilities. Refer to the MongoDB documentation for .
Superlinked requires access to MongoDB to list, create, and delete Atlas Search Indexes. As of writing, MongoDB separates functionality by database instance sizes. If you use anything below M10, the database does not support creating, listing, and deleting the Atlas Search Index via a standard user, only via the administration API. You can read more and also . To support all types, Superlinked uses the aforementioned API to manage the indexes.
Due to the reasons above, an API key with the Project Data Access Admin
role is required. More about how to create that can be found .
Note: When using that API, you will need
project_id
andcluster_name
, how to find this information is also described .
To integrate MongoDB, you need to add the MongoDBVectorDatabase
class and include it in the executor. Here’s how you can do it:
Project ID: to find your Project ID, select you organization in the top left corner of Atlas UI. Afterward, find your project (don't click on it). In the last column ("Actions") expend the menu by clicking on the ellipses (...), then select "Copy Project ID" which will paste it to your clipboard.
Alternatively, click on your project on Atlas and in the URL you will find the id: https://cloud.mongodb.com/v2/12755aca606daa697d3e30b9#/overview
where the 12755aca606daa697d3e30b9
before the #
and after the https://cloud.mongodb.com/v2/
is your project ID. The organization ID is very similar to this string, but please make sure that you copy the ID after you selected the project!
Once you have configured the MongoDBVectorDatabase
, set it as your vector_database
in the RestExecutor
:
A step-by-step guide to set up a database, a user, and the required API key.
Create your cluster. The cluster name will be needed for the configuration mentioned above. You can choose any other options as they do not impact Superlinked's functionality.
Click on the Database
option in the left menu column.
Once the cluster is created, click on its name and then go to the collections tab or click on the Browse Collections
button.
Click on Add My Own Data
and provide a name for your database and collection. The database name will be required for the configuration above. The collection name is not critical and can be deleted later as Superlinked will create its own.
Click on the Database
option on the left.
Click the Connect
button next to your cluster's name.
In the pop-up window:
Click on the Allow access from Anywhere
or select the Add a different IP address
and insert your VM's or local IP address.
Enter the username and password for your user. These credentials will be needed for the configuration above.
Click on the Access Manager
selector at the top left corner next to your organization selector and select your project.
Go to the API Keys
tab.
Provide a name for the API key and select the Project Data Access Admin
role in the Project Permissions
selector.
Copy the Private Key
as it will not be accessible again. The Public key
and Private key
will be your admin_api_user
and admin_api_password
in your connection in this order.
Extra parameters: extra params can be passed in to the PyMongo client called MongoClient. Please read the for more information.
Navigate to and sign in.
You can find an example that utilizes Mongo DB .