Generated: 2026-03-09 19:07
Document: HumanRights.pdf
Runs: 7
Summary
Pipelines Evaluated
7
anthropic, ollama, openai
Best KYS
0.904
claude-sonnet-4-6 / claude-haiku-4-5-20251001
Most Knowledge
174
entities + relations
Fastest Pipeline
85.7s
end-to-end wall time
Avg Knowledge Rate
0.70
nodes+edges / second
Total Entities (all)
482
across all pipelines
Leaderboard
Sorted by Knowledge Yield Score (KYS) — higher is better
# Backend Discovery Model Extraction Model KYS ↓ Entities Relations Graph Size Knowledge Rate Total Time Quality Score Speed Score
1 anthropic claude-sonnet-4-6 claude-haiku-4-5-20251001
0.904
87 73 160 1.66 /s 96.5s 0.919 0.888
2 openai gpt-4o gpt-4o-mini
0.754
47 52 99 1.16 /s 85.7s 0.569 1.000
3 ollama llama3.2:latest llama3.2:latest
0.581
88 86 174 0.68 /s 254.1s 1.000 0.337
4 ollama gemma3:12b gemma3:4b
0.580
94 34 128 0.68 /s 187.2s 0.736 0.457
5 ollama llama3.1:8b llama3.1:8b
0.456
93 80 173 0.42 /s 410.2s 0.994 0.209
6 ollama mistral:latest mistral:latest
0.279
46 35 81 0.16 /s 510.6s 0.466 0.168
7 ollama qwen2.5:14b qwen2.5:3b
0.261
27 28 55 0.14 /s 398.0s 0.316 0.215
Visual Analysis

Knowledge Yield Score (KYS)

Pipeline Time Breakdown (seconds)

Knowledge Graph Output

Knowledge Rate (graph elements / second)

Pipeline Details
claude-sonnet-4-6
extraction: claude-haiku-4-5-20251001
anthropic
#1
0.904
Knowledge Yield Score
Entities
87
Relations
73
Total Time
96.5s
Knowledge Rate
1.66/s
Schema
7.6s
Rephrase
23.6s
Extraction
60.1s
Consolidation
5.1s
gpt-4o
extraction: gpt-4o-mini
openai
#2
0.754
Knowledge Yield Score
Entities
47
Relations
52
Total Time
85.7s
Knowledge Rate
1.16/s
Schema
4.8s
Rephrase
48.3s
Extraction
26.8s
Consolidation
5.8s
llama3.2:latest
extraction: llama3.2:latest
ollama
#3
0.581
Knowledge Yield Score
Entities
88
Relations
86
Total Time
254.1s
Knowledge Rate
0.68/s
Schema
10.0s
Rephrase
35.8s
Extraction
204.9s
Consolidation
3.4s
gemma3:12b
extraction: gemma3:4b
ollama
#4
0.580
Knowledge Yield Score
Entities
94
Relations
34
Total Time
187.2s
Knowledge Rate
0.68/s
Schema
25.5s
Rephrase
36.3s
Extraction
120.8s
Consolidation
4.6s
llama3.1:8b
extraction: llama3.1:8b
ollama
#5
0.456
Knowledge Yield Score
Entities
93
Relations
80
Total Time
410.2s
Knowledge Rate
0.42/s
Schema
17.7s
Rephrase
36.8s
Extraction
342.0s
Consolidation
13.7s
mistral:latest
extraction: mistral:latest
ollama
#6
0.279
Knowledge Yield Score
Entities
46
Relations
35
Total Time
510.6s
Knowledge Rate
0.16/s
Schema
25.4s
Rephrase
49.8s
Extraction
406.2s
Consolidation
29.2s
qwen2.5:14b
extraction: qwen2.5:3b
ollama
#7
0.261
Knowledge Yield Score
Entities
27
Relations
28
Total Time
398.0s
Knowledge Rate
0.14/s
Schema
25.7s
Rephrase
37.8s
Extraction
333.3s
Consolidation
1.2s
Methodology – Knowledge Yield Score (KYS)

The Knowledge Yield Score (KYS) is a composite metric that balances output richness (how much the pipeline extracted) against efficiency (how fast it ran). It is the geometric mean of two normalized sub-scores, analogous to the F₁ score — it penalises runs that excel in only one dimension.

graph_size = num_entities + num_relations
total_time = schema_time + rephrase_time + extraction_time + consolidation_time

quality_norm = graph_size / max(graph_size)  # [0, 1]
speed_norm = min(total_time) / total_time  # [0, 1] — fastest run scores 1.0

KYS = √(quality_norm × speed_norm)  # geometric mean

quality_norm rewards pipelines that produce large, dense knowledge graphs. speed_norm rewards pipelines that finish quickly; the fastest run scores 1.0 and slower runs are penalised proportionally. The geometric mean ensures a pipeline cannot compensate for poor speed with high quality alone — both dimensions must be strong for a high KYS.

Knowledge Rate (graph_size / total_time) is a complementary raw throughput metric expressed in graph elements per second, useful for absolute comparisons independent of the normalization range.