TypeScript SDK

The TypeScript SDK wraps your existing LLM client instances and automatically captures latency, token counts, costs, and errors for every call — with no changes to your application logic beyond initialization.

Installation

Install the package

npm install @lumiqtrace/sdk

All provider integrations are optional peer dependencies. Install only what you use:

# OpenAI
npm install openai

# Anthropic
npm install @anthropic-ai/sdk

# Google Generative AI
npm install @google/generative-ai

# LangChain
npm install @langchain/core

Initialize LumiqTrace

Call lumiqtrace.init() once at application startup, before any LLM calls are made.

import { lumiqtrace } from "@lumiqtrace/sdk";

lumiqtrace.init({
  apiKey: process.env.LUMIQTRACE_API_KEY!,
  environment: "production",
});

Wrap your LLM client

Pass your client instance through the appropriate wrapper function. The wrapper returns the same client — your existing code continues to work unchanged.

import OpenAI from "openai";
import { lumiqtrace } from "@lumiqtrace/sdk";

const openai = lumiqtrace.wrapOpenAI(new OpenAI());

const response = await openai.chat.completions.create({
  model: "gpt-4o",
  messages: [{ role: "user", content: "Hello!" }],
});

Init options

lumiqtrace.init(options) accepts the following configuration:

apiKey

string

required

Your LumiqTrace API key. Must start with lqt_. Find this in your project settings.

environment

string

default:"production"

Environment label attached to every event. Use "staging" or "development" to separate traces by environment.

storePrompts

boolean

default:"false"

When true, prompt text and completion text are stored alongside traces. Disabled by default for privacy. Enable only after reviewing your data retention policy.

sampleRate

number

default:"1.0"

Fraction of events to send, between 0.0 and 1.0. Set to 0.1 to trace 10% of calls. Useful for high-volume production applications.

redactKeys

string[]

Keys whose values are redacted before any data is stored. Applied regardless of storePrompts.

debug

boolean

default:"false"

When true, logs internal errors to the console. Enable during integration testing.

batchSize

number

default:"50"

Number of events to accumulate before flushing. Events are also flushed every flushInterval milliseconds.

flushInterval

number

default:"2000"

Milliseconds between automatic flushes. The SDK also flushes on process exit.

baseURL

string

default:"https://api.lumiqtrace.com"

Override the API base URL. Use this only if you are self-hosting the ingest endpoint.

enableOTel

boolean

default:"false"

When true, also exports spans to an OpenTelemetry-compatible backend. Requires otelEndpoint or a configured OTEL exporter.

Wrapping providers

OpenAI

wrapOpenAI(client) patches client.chat.completions.create for both streaming and non-streaming calls. Cached token counts are extracted from prompt_tokens_details.cached_tokens.

import OpenAI from "openai";
import { lumiqtrace } from "@lumiqtrace/sdk";

lumiqtrace.init({ apiKey: process.env.LUMIQTRACE_API_KEY! });

const openai = lumiqtrace.wrapOpenAI(new OpenAI());

// Non-streaming
const response = await openai.chat.completions.create({
  model: "gpt-4o",
  messages: [{ role: "user", content: "Summarize this article." }],
});

// Streaming — TTFT (time to first token) is captured automatically
const stream = await openai.chat.completions.create({
  model: "gpt-4o",
  messages: [{ role: "user", content: "Write a poem." }],
  stream: true,
});
for await (const chunk of stream) {
  process.stdout.write(chunk.choices[0]?.delta?.content ?? "");
}

TTFT is measured from the call start to the first non-empty chunk and appears as ttft_ms in your LumiqTrace dashboard.

Anthropic

wrapAnthropic(client) patches client.messages.create. Token fields are extracted from result.usage.input_tokens, output_tokens, and cache_read_input_tokens.

import Anthropic from "@anthropic-ai/sdk";
import { lumiqtrace } from "@lumiqtrace/sdk";

lumiqtrace.init({ apiKey: process.env.LUMIQTRACE_API_KEY! });

const anthropic = lumiqtrace.wrapAnthropic(new Anthropic());

const message = await anthropic.messages.create({
  model: "claude-sonnet-4-6",
  max_tokens: 1024,
  messages: [{ role: "user", content: "Explain monads in plain English." }],
});

Prompt cache read tokens (cache_read_input_tokens) are tracked separately and appear as cached_tokens in traces. This lets you see the real savings from Anthropic’s prompt caching.

Google Generative AI

wrapGoogle(client) patches the getGenerativeModel() method and captures token counts from response.usageMetadata.

import { GoogleGenerativeAI } from "@google/generative-ai";
import { lumiqtrace } from "@lumiqtrace/sdk";

lumiqtrace.init({ apiKey: process.env.LUMIQTRACE_API_KEY! });

const google = lumiqtrace.wrapGoogle(new GoogleGenerativeAI(process.env.GOOGLE_API_KEY!));

const model = google.getGenerativeModel({ model: "gemini-2.5-flash" });
const result = await model.generateContent("What is quantum entanglement?");

OpenRouter

wrapOpenRouter(client) wraps OpenRouter clients, which share the OpenAI SDK interface.

import OpenAI from "openai";
import { lumiqtrace } from "@lumiqtrace/sdk";

lumiqtrace.init({ apiKey: process.env.LUMIQTRACE_API_KEY! });

const openrouter = lumiqtrace.wrapOpenRouter(
  new OpenAI({
    baseURL: "https://openrouter.ai/api/v1",
    apiKey: process.env.OPENROUTER_API_KEY!,
  })
);

const response = await openrouter.chat.completions.create({
  model: "anthropic/claude-sonnet-4-6",
  messages: [{ role: "user", content: "Hello from OpenRouter!" }],
});

AWS Bedrock

wrapBedrock(client) patches the converse method on an AWS Bedrock runtime client. Token counts are extracted from result.usage.inputTokens, outputTokens, and cacheReadInputTokens.

import { BedrockRuntimeClient } from "@aws-sdk/client-bedrock-runtime";
import { wrapBedrock } from "@lumiqtrace/sdk";

lumiqtrace.init({ apiKey: process.env.LUMIQTRACE_API_KEY! });

const bedrock = wrapBedrock(
  new BedrockRuntimeClient({ region: "us-east-1" })
);

const response = await bedrock.converse({
  modelId: "amazon.nova-lite-v1:0",
  messages: [{ role: "user", content: [{ text: "Summarize this document." }] }],
});

wrapBedrock patches client.converse. The converseStream method is not currently wrapped — streaming Bedrock calls are not traced.

Mistral

wrapMistral(client) patches client.chat.complete. Token counts are extracted from result.usage.promptTokens and completionTokens.

import { Mistral } from "@mistralai/mistralai";
import { wrapMistral } from "@lumiqtrace/sdk";

lumiqtrace.init({ apiKey: process.env.LUMIQTRACE_API_KEY! });

const mistral = wrapMistral(new Mistral({ apiKey: process.env.MISTRAL_API_KEY! }));

const response = await mistral.chat.complete({
  model: "mistral-small-latest",
  messages: [{ role: "user", content: "What is the capital of France?" }],
});

Groq

wrapGroq(client) patches client.chat.completions.create. Token counts are extracted from result.usage.prompt_tokens and completion_tokens.

import Groq from "groq-sdk";
import { wrapGroq } from "@lumiqtrace/sdk";

lumiqtrace.init({ apiKey: process.env.LUMIQTRACE_API_KEY! });

const groq = wrapGroq(new Groq({ apiKey: process.env.GROQ_API_KEY! }));

const response = await groq.chat.completions.create({
  model: "llama-3.3-70b-versatile",
  messages: [{ role: "user", content: "Explain gradient descent." }],
});

LiteLLM

wrapLiteLLM(options) creates an OpenAI-compatible client pointed at your LiteLLM proxy and wraps it with the OpenAI wrapper. Use this when you route multiple providers through a LiteLLM gateway.

import { wrapLiteLLM } from "@lumiqtrace/sdk";

lumiqtrace.init({ apiKey: process.env.LUMIQTRACE_API_KEY! });

const litellm = wrapLiteLLM({
  baseURL: "http://localhost:4000",
  apiKey: process.env.LITELLM_API_KEY,
});

const response = await litellm.chat.completions.create({
  model: "gpt-4o",
  messages: [{ role: "user", content: "Hello from LiteLLM!" }],
});

baseURL

string

required

The base URL of your LiteLLM proxy server.

apiKey

string

Optional API key for your LiteLLM proxy. Defaults to "litellm" if omitted.

Context enrichment

Use withLumiqtraceContext to attach a userId, sessionId, or custom tags to all traces generated within a function scope. This is useful for associating LLM calls with a specific user session or request.

import { withLumiqtraceContext, lumiqtrace } from "@lumiqtrace/sdk";

lumiqtrace.init({ apiKey: process.env.LUMIQTRACE_API_KEY! });
const openai = lumiqtrace.wrapOpenAI(new OpenAI());

// In a request handler:
export async function POST(req: Request) {
  const { userId, sessionId } = await getSession(req);

  return withLumiqtraceContext(
    {
      userId,
      sessionId,
      tags: { feature: "chat", plan: "pro" },
    },
    async () => {
      const response = await openai.chat.completions.create({
        model: "gpt-4o",
        messages: [{ role: "user", content: await req.text() }],
      });
      return Response.json(response);
    }
  );
}

withLumiqTrace is an alias for withLumiqtraceContext and behaves identically.

Set userId and sessionId on every request that involves a logged-in user. This enables per-user cost breakdown and session replay in the LumiqTrace dashboard.

Get Started

Python SDK

Integrations

MCP Server

Guides

TypeScript SDK — @lumiqtrace/sdk

Installation

Init options

Wrapping providers

OpenAI

Anthropic

Google Generative AI

OpenRouter

AWS Bedrock

Mistral

Groq

LiteLLM

Context enrichment

Get Started

TypeScript SDK

Python SDK

Integrations

MCP Server

Guides

Documentation Index

​Installation

​Init options

​Wrapping providers

​OpenAI

​Anthropic

​Google Generative AI

​OpenRouter

​AWS Bedrock

​Mistral

​Groq

​LiteLLM

​Context enrichment

Installation

Init options

Wrapping providers

OpenAI

Anthropic

Google Generative AI

OpenRouter

AWS Bedrock

Mistral

Groq

LiteLLM

Context enrichment