> ## Documentation Index
> Fetch the complete documentation index at: https://explore.airia.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Context Engineering

> Design and optimize the information pipeline that feeds your AI agents — from ingestion to enrichment to retrieval.

Context engineering is the discipline of designing and optimizing the information pipeline that feeds your AI agents. It encompasses everything from how data enters your system, how it gets enriched with structure and meaning, and how the right pieces are retrieved at query time to produce accurate, grounded responses.

Airia provides a complete context engineering pipeline — from connecting your enterprise data to delivering precisely the right context to any LLM, through any interface.

## The Airia Context Pipeline

Every piece of knowledge your agents use flows through four stages:

```
Connect → Process → Enrich → Retrieve
```

**Connect** — Bring your data in from 20+ enterprise sources (SharePoint, Google Drive, Confluence, S3, and more) with real-time sync and permission enforcement.

**Process** — Documents are parsed, chunked, and embedded into vector representations. Choose your PDF parser, enable image scanning, select your embedding model, and configure your vector database.

**Enrich** — Optionally extract structured knowledge from your documents. Knowledge Graph Extraction identifies entities and relationships, creating a graph layer on top of your vector store that dramatically improves retrieval quality for complex queries.

**Retrieve** — Search your knowledge base using semantic search, keyword search, hybrid combinations, or agentic multi-hop retrieval via MCP. Add reranking for precision. Let your agents decide dynamically what to search, when, and how many times.

## Choosing a Retrieval Pattern

Airia supports three retrieval patterns. Choose based on your use case:

| Pattern                     | How it works                                                                                                                                                            | Best for                                                                                       |
| --------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------------------- |
| **Data Search Step**        | Single-hop, embedding-based search. The full user query is used to find matching chunks in one pass.                                                                    | Simple Q\&A, batch processing, predictable queries. Fast and low-cost.                         |
| **MCP Multi-Hop Retrieval** | Multi-hop agentic retrieval via the Airia Datasource MCP Server. The LLM autonomously decides which sources to query, which tools to use, and how many searches to run. | Complex questions, conversational agents, multi-source reasoning, accuracy-critical workflows. |
| **Text-to-SQL**             | Translates natural language into SQL queries against structured data (CSV, XLSX).                                                                                       | Numerical analysis, tabular data, structured reporting.                                        |

These patterns can be combined in a single agent. For example, an agent might use a Data Search Step for fast initial lookup and a MCP Multi-Hop Retrieval for deeper follow-up reasoning.

## What's in This Section

| Page                                                                            | What you'll learn                                                                  |
| ------------------------------------------------------------------------------- | ---------------------------------------------------------------------------------- |
| [Connecting Data Sources](/integrations/Data-Source-Connectors/data-sources)    | How to connect enterprise sources, supported formats, sync scheduling, permissions |
| [Ingestion Settings](/integrations/Data-Source-Connectors/Ingestion%20settings) | PDF parsers, image scanning, embedding models, vector database configuration       |
| [Knowledge Graph Extraction](/context-engineering/knowledge-graph-extraction)   | Industry presets, custom entity types, how Graph RAG works                         |
| [Custom Knowledge Graphs](/context-engineering/custom-knowledge-graphs)         | Building runtime graphs with Cypher queries                                        |
| [Retrieval Methods](/context-engineering/retrieval-methods)                     | Data Search Step, MCP Multi-Hop Retrieval (MCP), Text-to-SQL, configuration        |
| [Hybrid Search and Reranking](/context-engineering/hybrid-search-reranking)     | Semantic vs keyword search, fusion algorithms, reranker models                     |
| [Graph-Enhanced Retrieval](/context-engineering/graph-enhanced-retrieval)       | How knowledge graphs boost retrieval quality                                       |

### Guides

* [Quick Start: Ingest and Search Your First Data Source](/context-engineering/guides/quick-start)
* [Set Up Graph RAG for Your Industry](/context-engineering/guides/graph-rag)
* [Connect Airia Retrieval to External Agents via MCP](/context-engineering/guides/mcp-retrieval)