Ali's Newsletter
Posts
🚀 Superlinked: The Next Evolution in Retrieval & RAG Systems 🔍🤖

🚀 Superlinked: The Next Evolution in Retrieval & RAG Systems 🔍🤖

Have you ever wondered why your search results or RAG (retrieval-augmented generation) pipeline sometimes feels... flat?That’s because traditional vector search often focuses only on unstructured text embeddings — while ignoring the structured metadata (like recency, categories, ratings, numbers, or images) that could massively improve relevance. That’s where Superlinked ✨ comes in.

Ali Ali
September 20, 2025

It’s an open-source framework that lets you combine structured + unstructured data into embeddings, and dynamically tune retrieval at query time.

Let’s break it all down. 🧩

🔑 High-Level Idea

Superlinked is all about spaces:

Each field of your data (text, number, category, timestamp, image) is mapped into its own embedding space.
At query time, you compose these spaces together with weights and parameters to get a final, ranked result.

👉 This means you can run multi-signal retrieval that is context-aware, tunable, and doesn’t require re-indexing.

📝 The repo puts it simply: “Improve your vector search relevance by encoding metadata together with your unstructured data into vectors.”

🧱 Core Building Blocks

From the repo’s examples, here are the main components you’ll work with:

Schema 🏗️ – Defines your data fields (e.g., id, text, rating, timestamp, category).
Spaces 🌌 – Encoders for each field:
- TextSimilaritySpace → for text embeddings (sentence-transformers, etc.).
- NumberSpace → for numeric values (ratings, prices, priorities).
- CategoricalSpace → for discrete categories.
- RecencySpace / EventEffectSpace → for time-based signals.
- ImageSimilaritySpace → for visual embeddings.
Index 📇 – Groups one or more spaces and decides which fields to store.
Source & Executor ⚡ – Handle ingestion (InMemorySource, InMemoryExecutor in the examples).
Query 🔍 – Declarative search builder with .find(), .similar(), .limit(), .select_all().
Param 🎛️ – Query-time knobs you can adjust: weights, queries, limits.
Optional LLM parsing 🤖 – Extract query params from natural language with .with_natural_query(...).

⚙️ How It Works (Step by Step)

Define a Schema

class Review(sl.Schema):
    id: sl.IdField
    text: sl.String
#This declares the structure of your documents.

Create Spaces

space = sl.TextSimilaritySpace(text=review.text, model="all-MiniLM-L6-v2")
#Each field gets its own encoder.

Create an Index

index = sl.Index(space)
#This binds your space(s) together.

Ingest Data

source = sl.InMemorySource(review)
app = sl.InMemoryExecutor(sources=[source], indices=[index]).run()
source.put([{"id": "1", "text": "Amazing acting"}, {"id": "2", "text": "Boring plot"}])
#Documents are embedded and stored.

Build & Run a Query

query = sl.Query(index).find(review).similar(space, sl.Param("search")).select_all()
result = app.query(query, search="excellent performance")
#Parameters (sl.Param) let you tune search at runtime

👉 Retrieval = fusion of embeddings across multiple spaces, combined with query-time weights.

🧪 Example Test Cases (from repo)

The repo shows how to test everything with pytest. Example:

Text-only retrieval → check if semantic match works.
Text + Rating weighting → bias results towards highly-rated products.
Limit & Metadata → ensure filters and limits apply correctly.

(Full runnable code examples are provided in the referenced repo ✅).

💡 Why It Matters for RAG & Embedding Systems

Traditional RAG = one embedding per doc.
Superlinked RAG = multiple embeddings per doc (spaces), fused together with weights.

That unlocks new retrieval powers:

🕒 Recency bias for fast-changing data (support, news, medical).
⭐ Personalization with user-specific weights.
🖼️ Multi-modal fusion (text + images + numbers).
📚 Hierarchical retrieval (section → paragraph → sentence).
🤖 LLM-driven adaptive retrieval (auto-extracted params).

🚀 Use Cases in RAG / Advanced Retrieval

1. Context-Aware Retrieval for Support Bots 🛠️

Schema: ticket body + category + priority + created_at.
Spaces: text, categorical, number, recency.
👉 Prioritize recent, high-priority, same-category tickets.

2. Personalized Shopping Assistant 🛒

Schema: description + price + rating + category + stock.
Spaces: text + number(price) + number(rating) + event(stock).
👉 Query: “Best affordable laptops under $500” → retrieval balances affordability, rating, availability.

3. Nested / Hierarchical Retrieval 📖

Schema: doc with sections → paragraphs → sentences.
Spaces: section titles (broad), sentence text (fine-grained), recency.
👉 Retrieve relevant snippet inside relevant section. Perfect for long-doc RAG.

4. Adaptive Query Understanding 🤯

Feature: .with_natural_query(...).
Example: “Show me the cheapest recent smartphones like iPhone 14”.
LLM parses into:

description_query = "iPhone 14"
price_weight = 10
recency_weight = 5
👉 Retrieval adapts dynamically.

Schema: manuals + product_image + numeric_specs.
Spaces: text + image + number.
👉 User uploads a photo + asks: “Find manuals for this device under $200”.
Fusion across modalities → perfect match.

📊 Key Advantages

🔄 No re-indexing needed when you change retrieval strategy.
🎛️ Query-time personalization & parameter tuning.
🖼️ Multi-modal fusion (text + image + numbers).
🧩 Structured + unstructured fusion in one framework.
🤖 Seamless LLM integration for natural query parsing.

⚡ Final Takeaway

Superlinked isn’t just another vector search tool —
it’s a composable retrieval framework for the next generation of RAG systems.

Instead of forcing one-size-fits-all embeddings, you get:
👉 multi-space embeddings (per field)
👉 query-time parameter control (weights, filters, recency)
👉 fusion across modalities (text, numbers, images, time)

✨ In other words: RAG that’s smarter, faster, and way more context-aware.

🎉 Conclusion

This means Superlinked becomes your retriever layer in a RAG pipeline, replacing “vanilla vector search” with a composable, multi-signal retriever.

⚡ So, in short: Superlinked enables structured + unstructured fusion at retrieval-time, which is a huge leap over vanilla RAG. You can think of it as “parametric retrieval over multiple embedding spaces”, which allows things like nested retrieval, personalization, multimodal RAG, and adaptive context selection.

Superlinked gives you the ability to treat structured + unstructured fields as first-class signals and fuse them at retrieval time.

This makes it a game-changer for:

RAG pipelines 🧠
Recommendation systems ⭐
Multi-modal assistants 🎨
Dynamic search engines 🔍

If you care about relevance in AI-powered retrieval, this is technology you want to watch closely. 🚀

References

GitHub - superlinked/superlinked: Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured and unstructured data.

Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured and unstructured data. - superlinked/superlinked

github.com/superlinked/superlinked

Langchain_qdrant reference

youtube-stuffs/langchain/langchain_qdrant.ipynb at main · sudarshan-koirala/youtube-stuffs

This repository contains most of the python codes that is presented in my youtube videos. - sudarshan-koirala/youtube-stuffs

github.com/sudarshan-koirala/youtube-stuffs/blob/main/langchain/langchain_qdrant.ipynb