Provider Configuration Guide¶
Configure Hyper-Extract to work with OpenAI, Bailian (Alibaba Cloud), or local vLLM deployments.
Unified Example¶
All three platforms run the same extraction task below. Only the client setup (first 3 lines) changes.
OpenAI¶
from hyperextract import create_client, AutoGraph
llm, emb = create_client("openai", api_key="sk-xxx")
Alibaba Bailian¶
from hyperextract import create_client, AutoGraph
llm, emb = create_client("bailian", api_key="sk-xxx")
# Or override model: create_client("bailian:qwen3.6-plus", api_key="sk-xxx")
Local vLLM¶
from hyperextract import create_client, AutoGraph
llm, emb = create_client(
llm="vllm:Qwen3.5-9B@http://localhost:8000/v1",
embedder="vllm:bge-m3@http://localhost:8001/v1",
api_key="dummy",
)
Extraction Task (same for all)¶
graph = AutoGraph(
instruction="Extract people and their relationships",
llm_client=llm,
embedder=emb,
node_key_extractor=lambda n: n.name,
edge_key_extractor=lambda e: (e.source, e.target, e.type),
nodes_in_edge_extractor=lambda e: (e.source, e.target),
)
text = "Zhang San founded ByteDance. Li Si serves as CEO."
graph.parse(text)
print(f"Nodes: {len(graph.nodes)}, Edges: {len(graph.edges)}")
CLI Equivalents¶
| Platform | Command |
|---|---|
| OpenAI | he config init -p openai -k sk-xxx |
| Bailian | he config init -p bailian -k sk-xxx |
| vLLM | he config init → select "local vLLM" |
| Mixed (LLM=Bailian, Embedder=vLLM) | he config llm -p bailian -k sk-xxx + he config embedder -p vllm -u http://localhost:8001/v1 -k dummy |
String Shorthand Format¶
create_client() supports a compact string syntax for quick configuration:
| Format | Example | Result |
|---|---|---|
provider |
"bailian" |
Uses preset defaults for LLM + embedder |
provider:model |
"bailian:qwen3.6-plus" |
Overrides LLM model, keeps preset embedder |
provider:model@url |
"vllm:Qwen3.5-9B@localhost:8000/v1" |
Full manual specification |
Using Config File Instead¶
If you prefer file-based configuration, run he config init once and then use Template.create() or get_client():
from hyperextract import get_client, AutoGraph
llm, emb = get_client() # Reads ~/.he/config.toml
graph = AutoGraph(..., llm_client=llm, embedder=emb)
See Also¶
- Provider System & Model Compatibility — Full compatibility table and vLLM deployment guide
- CLI Configuration Reference — Complete
he configcommand reference