Name	Name	Last commit message	Last commit date
parent directory ..
src	src
test_data	test_data
.env.example	.env.example
.gitignore	.gitignore
README.md	README.md
RushDB-RAG-API.postman_collection.json	RushDB-RAG-API.postman_collection.json
import_data.py	import_data.py
pyproject.toml	pyproject.toml
run_app.py	run_app.py
uv.lock	uv.lock

RushDB Generic RAG API

A generic RAG (Retrieval Augmented Generation) API using RushDB for record vectorization and vector search capabilities.

Features

Generic Record Processing: Index any text field from any record type in RushDB
Vector Embeddings: Use sentence transformers to create embeddings for semantic search
RushDB Integration: Add embedding properties directly to existing records
Vector Search: Search for relevant records using cosine similarity
FastAPI Interface: RESTful API for easy integration
Auto-Configuration: Automatic initialization from environment variables

Setup with UV

This project uses UV for dependency management. Make sure you have UV installed:

# Install UV if you haven't already
curl -LsSf https://astral.sh/uv/install.sh | sh

# Clone the repository and navigate to the project
cd python-books-rag

# Install dependencies
uv sync

Configuration

Copy the example environment file:

cp .env.example .env

Edit .env and add your RushDB API token:

# Get your API token from https://app.rushdb.com/
RUSHDB_API_TOKEN=your_actual_token_here

(Optional) Customize other settings in .env:

EMBEDDING_MODEL=all-MiniLM-L6-v2

Quick Start

Run the application:

uv run python run_app.py

Or start the API server directly:

uv run uvicorn src.api:app --host 0.0.0.0 --port 8000 --reload

The application will automatically initialize from your .env configuration. The API will be available at http://localhost:8000 with interactive docs at http://localhost:8000/docs.

Install Dependencies

# Navigate to the project directory
cd /path/to/project

# Install dependencies with UV
uv sync

Configuration

You'll need a RushDB API token. You can get one from:

RushDB Cloud Dashboard (for cloud instance)
Your self-hosted RushDB instance

Usage

The application provides a RESTful API for record indexing and search. All configuration is handled through environment variables - no manual initialization required.

API Endpoints

Check API status and configuration:

curl http://localhost:8000/

Health check:

curl http://localhost:8000/health

Index records:

curl -X POST "http://localhost:8000/index" \
  -H "Content-Type: application/json" \
  -d '{
    "labels": ["Article"],
    "field": "content",
    "vector_dimension": 384
  }'

You can also use more complex search queries for indexing:

curl -X POST "http://localhost:8000/index" \
  -H "Content-Type: application/json" \
  -d '{
    "labels": ["Article"],
    "where": {"category": "technology"},
    "field": "content",
    "vector_dimension": 384,
    "limit": 500
  }'

Search records (basic search):

curl -X POST "http://localhost:8000/search" \
  -H "Content-Type: application/json" \
  -d '{
    "labels": ["Article"],
    "query": "What is RushDB?",
    "limit": 5
  }'

Advanced search with filtering:

curl -X POST "http://localhost:8000/search" \
  -H "Content-Type: application/json" \
  -d '{
    "labels": ["Article"],
    "query": "What is RushDB?",
    "limit": 5,
    "vector_dimension": 384,
    "min_score": 0.7,
    "offset": 0
  }'

All endpoints return JSON responses. The API automatically initializes from your .env configuration on startup.

How It Works

Data Structure in RushDB

The application adds embedding properties directly to existing records:

Record (e.g., Article)
{
  "title": "Sample Article",
  "content": "This is the article content...",
  "embedding": [0.1, 0.2, 0.3, ...],
  // ... other properties
}

Processing Flow

Record Selection: Records are retrieved from RushDB using the provided search query
Content Extraction: Text from the specified field is extracted
Vectorization: The content is converted to a vector embedding using sentence transformers
Storage: The embedding is added as a property to the existing record
Search: Vector similarity search is performed directly on the records

Vector Search

The application uses RushDB's powerful vector search capabilities with the following features:

Label-based filtering: Target specific record types
Vector similarity: Calculate cosine similarity between query and stored embeddings
Minimum score threshold: Filter out low-relevance results (optional)
Sorting: Order results by similarity score
Pagination: Control the number of results returned

Search parameters:

labels: Labels of records to search
query: Text query to find similar content
limit: Maximum number of results to return
min_score: Minimum similarity threshold (0-1)
offset: Number of results to skip (for pagination)
vector_dimension: Control embedding size/quality tradeoff

# Basic vector search query
results = db.records.find({
    "labels": ["Article"],
    "aggregate": {
        "score": {
            "alias": "$record",
            "field": "embedding",
            "fn": "gds.similarity.cosine",
            "query": query_vector
        }
    },
    "orderBy": { "score": "desc" },
    "limit": limit
})

Development

Code Structure

src/rag_engine.py: Core RAG implementation with text processing and RushDB operations
src/api.py: FastAPI application with REST endpoints
src/config.py: Configuration management and environment variable handling
run_app.py: Application runner with testing and server startup
pyproject.toml: Project configuration and dependencies

Key Components

TextProcessor: Handles text vectorization
RagService: Manages RushDB operations for indexing and search
FastAPI App: RESTful API with automatic configuration from environment

Customization

Embedding Model: Change the EMBEDDING_MODEL in .env to use different sentence transformer models
Vector Dimensions: Use the vector_dimension parameter in API requests to specify the embedding dimension:
- 384: Uses the all-MiniLM-L6-v2 model (faster, smaller embeddings)
- 768: Uses the all-mpnet-base-v2 model (slower, more accurate embeddings)
Search Configuration: Modify similarity scoring in the search aggregation
Record Selection: Specify different record labels and fields to process

Dependencies

fastapi: Web framework for the API
rushdb: RushDB Python SDK
sentence-transformers: For text embeddings
uvicorn: ASGI server
pydantic: Data validation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

RushDB Generic RAG API

Features

Setup with UV

Configuration

Quick Start

Install Dependencies

Configuration

Usage

API Endpoints

How It Works

Data Structure in RushDB

Processing Flow

Vector Search

Development

Code Structure

Key Components

Customization

Dependencies

FilesExpand file tree

python-books-rag

Directory actions

More options

Directory actions

More options

Latest commit

History

python-books-rag

Folders and files

parent directory

README.md

RushDB Generic RAG API

Features

Setup with UV

Configuration

Quick Start

Install Dependencies

Configuration

Usage

API Endpoints

How It Works

Data Structure in RushDB

Processing Flow

Vector Search

Development

Code Structure

Key Components

Customization

Dependencies