Extract entities, relationships, and topics from complex documents automatically. Get actionable knowledge graphs in minutes instead of weeks.
Knwler is open source — MIT license.
You can install knwler in minutes using pipx. It creates an isolated environment and makes the knwler command available globally without affecting your system Python or other projects.
After installation, run the pipeline on your documents with a single command. To process a PDF and export everything to a specific folder:
This uses Ollama with the Qwen 2.5 3B model by default. To use OpenAI, set your API key and add the --openai flag:
To see all available commands and options:
You can also install via UV or directly from GitHub for the latest features:
Enterprise-grade document intelligence
Purpose-built for organizations that need to extract structured insight from regulatory, legal, and compliance documents at scale.
The pipeline analyzes document samples and infers optimal entity and relation types. No manual ontology engineering required.
Auto-detects language and adapts all prompts. Supports English, German, French, Spanish, and Dutch out of the box.
Run fully on-premise with local LLMs. Zero data leaves your infrastructure. Meets the strictest data sovereignty requirements.
Automatically discovers clusters of related entities and assigns human-readable topic labels for instant thematic insight.
Export to JSON, GML, GraphML, and interactive HTML. Import directly into Neo4j, Gephi, yEd, Memgraph, or SurrealDB.
Process new documents and the existing graph is augmented, not rebuilt. Entity descriptions are consolidated intelligently.
From document to knowledge in three steps
A streamlined pipeline that turns unstructured text into structured, queryable knowledge.
Upload PDF, text, or Markdown documents. The system extracts and segments content into optimally sized chunks for analysis.
LLM-powered extraction identifies entities, relationships, and topics. Schema is discovered automatically or supplied by your team.
Receive an interactive knowledge graph with community detection, topic labels, and exports ready for your graph analytics platform.
Built for high-stakes document workflows
Trusted by teams where accuracy, traceability, and data privacy are non-negotiable.
Map entities and obligations across 100+ page regulatory documents. Understand cross-references and dependencies at a glance.
Extract parties, clauses, and contractual relationships from legal documents. Surface hidden connections across document sets.
Build knowledge graphs from academic papers and reports. Identify key concepts, authors, and methodological connections.
Accelerate M&A and audit processes by automatically structuring findings from financial disclosures and corporate filings.
Each example was processed in minutes with full knowledge graph output. Explore the interactive reports yourself.
All nine books of the Belgian Civil Law were extracted and consolidated via Knwler. The visualization was made with Linkurious Ogma.

The Human Rights in an advance graphviz based on yFiles.

The Deloitte 2024 Global Impact Report highlights the firm's commitment to driving inclusive and sustainable progress amidst global challenges like geopolitical tension and rapid technological change.

The NIST AI Risk Management Framework (AI RMF 1.0) provides a structured approach for organizations to identify, assess, and manage risks associated with artificial intelligence systems.

The basic version is open source. Get started today or request a demo of the full platform.