FastAPI middleware — automatic HTTP request tracing

The lumiq-fastapi package adds ASGI middleware to your FastAPI application that automatically creates a trace context for every incoming HTTP request. You can then use the @observe_llm and @observe_span decorators to attach LLM call details and custom spans to that context without passing trace objects through your call stack.

Installation

pip install lumiq-fastapi

lumiq-fastapi requires httpx for flushing traces. Install it alongside the package if it is not already in your environment:

pip install lumiq-fastapi httpx

Adding the middleware

Import LumiqFastAPIMiddleware and LumiqConfig, then register the middleware with your FastAPI app using add_middleware.

from fastapi import FastAPI
from lumiq_fastapi import LumiqFastAPIMiddleware, LumiqConfig

app = FastAPI()

config = LumiqConfig(api_key="lqt_your_api_key_here")
app.add_middleware(LumiqFastAPIMiddleware, config=config)

Once registered, every HTTP request to your app creates a trace context. The middleware records the HTTP method, path, response status, and total request duration, then flushes the data to LumiqTrace in the background.

`LumiqConfig` options

api_key

string

required

Your LumiqTrace API key. Must start with lqt_.

base_url

string

default:"https://api.lumiqtrace.com"

Override the LumiqTrace API base URL. Use this only if you are self-hosting the ingest endpoint.

enabled

boolean

default:"True"

When False, the middleware passes all requests through without tracing. Useful for disabling tracing in test environments.

sample_rate

float

default:"1.0"

Fraction of requests to trace, between 0.0 and 1.0. Set to 0.1 to trace 10% of requests in high-traffic environments.

capture_request_body

boolean

default:"False"

When True, the request body is captured and attached to the trace. Enable only after reviewing your data retention policy — request bodies may contain sensitive data.

capture_response_body

boolean

default:"False"

When True, the response body is captured and attached to the trace.

redact_fields

list[str]

default:"[\"password\", \"token\", \"api_key\"]"

Field names whose values are redacted before any data is stored. Applied regardless of capture_request_body.

debug

boolean

default:"False"

When True, logs flush errors to stdout. Enable during integration testing.

Tracing LLM calls with `@observe_llm`

Use the @observe_llm decorator on any async function that makes an LLM call. The decorator captures the model name, provider, duration, and token usage (extracted automatically from the response object if it has a .usage attribute).

from fastapi import FastAPI
from lumiq_fastapi import LumiqFastAPIMiddleware, LumiqConfig, observe_llm
import openai

app = FastAPI()
app.add_middleware(LumiqFastAPIMiddleware, config=LumiqConfig(api_key="lqt_your_api_key_here"))

client = openai.AsyncOpenAI()

@observe_llm(model="gpt-4o", provider="openai")
async def generate_summary(text: str) -> str:
    response = await client.chat.completions.create(
        model="gpt-4o",
        messages=[
            {"role": "system", "content": "Summarize the following text concisely."},
            {"role": "user", "content": text},
        ],
    )
    return response.choices[0].message.content

@app.post("/summarize")
async def summarize_endpoint(body: dict):
    summary = await generate_summary(body["text"])
    return {"summary": summary}

@observe_llm parameters:

model

string

The model identifier to record on the span. Example: "gpt-4o", "claude-sonnet-4-6".

provider

string

The provider name to record on the span. Example: "openai", "anthropic".

Tracing custom operations with `@observe_span`

Use @observe_span to instrument any async function as a named span. The decorator records the function name, duration, and status (success or error) as attributes on the current request’s trace context.

from lumiq_fastapi import observe_span

@observe_span("document-retrieval", span_type="retriever")
async def retrieve_documents(query: str) -> list[str]:
    # Fetch relevant documents from your vector store
    results = await vector_store.search(query, top_k=5)
    return [doc.content for doc in results]

@observe_span("rag-pipeline", span_type="custom")
async def run_rag(query: str) -> str:
    docs = await retrieve_documents(query)
    summary = await generate_summary("\n\n".join(docs))
    return summary

@observe_span parameters:

name

string

required

The span name. Appears as the operation label in the trace view.

span_type

string

default:"custom"

A hint for the span type. Common values: "llm", "retriever", "custom".

Setting custom attributes

Use set_attribute to attach arbitrary key-value data to the current request’s trace context. Call it from anywhere in your request handler — the attribute is included in the trace when it flushes.

from lumiq_fastapi import set_attribute

@app.post("/chat")
async def chat_endpoint(body: dict):
    user_id = body.get("user_id")
    set_attribute("user.id", user_id)
    set_attribute("feature", "chat")

    response = await generate_response(body["message"])
    return {"response": response}

Getting the current trace ID

Use get_current_trace_id() to retrieve the trace ID for the current request. This is useful for logging or for correlating LumiqTrace data with your own observability stack.

from lumiq_fastapi import get_current_trace_id
import logging

logger = logging.getLogger(__name__)

@app.post("/generate")
async def generate_endpoint(body: dict):
    trace_id = get_current_trace_id()
    logger.info("Handling request", extra={"lumiqtrace_id": trace_id})

    result = await generate_summary(body["text"])
    return {"result": result, "trace_id": trace_id}

Complete example

This example shows a FastAPI app with middleware, an LLM call, a retrieval step, and custom attributes all wired together.

from fastapi import FastAPI
from lumiq_fastapi import (
    LumiqFastAPIMiddleware,
    LumiqConfig,
    observe_llm,
    observe_span,
    set_attribute,
)
import openai

app = FastAPI()
app.add_middleware(
    LumiqFastAPIMiddleware,
    config=LumiqConfig(
        api_key="lqt_your_api_key_here",
        environment="production",
        sample_rate=1.0,
    ),
)

openai_client = openai.AsyncOpenAI()

@observe_span("vector-search", span_type="retriever")
async def search_docs(query: str) -> list[str]:
    # Replace with your actual vector store call
    return ["Relevant document 1", "Relevant document 2"]

@observe_llm(model="gpt-4o", provider="openai")
async def answer_question(question: str, context: str) -> str:
    response = await openai_client.chat.completions.create(
        model="gpt-4o",
        messages=[
            {"role": "system", "content": f"Answer using this context:\n{context}"},
            {"role": "user", "content": question},
        ],
    )
    return response.choices[0].message.content

@app.post("/ask")
async def ask_endpoint(body: dict):
    user_id = body.get("user_id", "anonymous")
    set_attribute("user.id", user_id)

    docs = await search_docs(body["question"])
    answer = await answer_question(body["question"], "\n".join(docs))

    return {"answer": answer}

The middleware flushes traces asynchronously in batches of 10 or every 5 seconds, whichever comes first. In short-lived processes or test environments, call await middleware._flush() explicitly before the process exits to ensure all traces are sent.

Get Started

TypeScript SDK

Python SDK

Integrations

MCP Server

Guides

FastAPI middleware — automatic HTTP request tracing

Installation

Adding the middleware

`LumiqConfig` options

Tracing LLM calls with `@observe_llm`

Tracing custom operations with `@observe_span`

Setting custom attributes

Getting the current trace ID

Complete example

Get Started

TypeScript SDK

Python SDK

Integrations

MCP Server

Guides

Documentation Index

​Installation

​Adding the middleware

​LumiqConfig options

​Tracing LLM calls with @observe_llm

​Tracing custom operations with @observe_span

​Setting custom attributes

​Getting the current trace ID

​Complete example

Installation

Adding the middleware

`LumiqConfig` options

Tracing LLM calls with `@observe_llm`

Tracing custom operations with `@observe_span`

Setting custom attributes

Getting the current trace ID

Complete example