Anvia Docs

Use .extract(extractor) when a pipeline stage should convert the current text into typed data.

1. Build the Extractor

import { ExtractorBuilder } from "@anvia/core/extractor";
import { PipelineBuilder } from "@anvia/core/pipeline";
import { z } from "zod";

const triageSchema = z.object({
  priority: z.enum(["low", "medium", "high"]),
  summary: z.string(),
});

const extractor = new ExtractorBuilder(model, triageSchema)
  .instructions("Extract triage fields from the support summary.")
  .retries(1)
  .build();

2. Extract Inside the Pipeline

const pipeline = new PipelineBuilder(z.string())
  .step((ticket) => `Summarize this ticket:\n\n${ticket}`)
  .prompt(summarizer)
  .extract(extractor)
  .build();

The output type is now the schema data.

const triage = await pipeline.run("Checkout is down for enterprise users.");

console.log(triage.priority);

3. Add Application Logic After Extraction

const pipeline = new PipelineBuilder(z.string())
  .prompt(summarizer)
  .extract(extractor)
  .step((triage) => ({
    ...triage,
    route: triage.priority === "high" ? "incident" : "support",
  }))
  .build();

Use extractor steps after text generation and before business logic that needs structured fields.

Extractor Steps

1. Build the Extractor

2. Extract Inside the Pipeline

3. Add Application Logic After Extraction