Open Data & Transparency Portal

Access, analyze, and integrate Aevum's verified knowledge graphs, multilingual corpora, and editorial datasets. Open by default, versioned for reproducibility, and optimized for researchers and developers.

48
Public Datasets
2.4M
Knowledge Edges
140+
Languages
v4.2.1
Current Schema

Available Datasets

Knowledge Graph v4.2.1

Global Knowledge Graph

Entity-relation triples spanning science, history, culture, and technology. Includes confidence scores and source provenance.

Format: JSON-LD, RDF, CSV
Size: 4.2 GB
Records: 2.4M
Updated: Daily
Multilingual Corpus v3.8.0

Encyclopedia Text Corpus

Pre-processed, tokenized, and aligned article text across 140+ languages. Optimized for NLP training and cross-lingual analysis.

Format: JSONL, Parquet
Size: 18.7 GB
Records: 2.1M
Updated: Weekly
Editorial Metadata v2.1.4

Peer Review & Audit Logs

Transparent editorial workflow data: revision histories, reviewer annotations, fact-check timestamps, and consensus metrics.

Format: CSV, XML
Size: 840 MB
Records: 6.8M
Updated: Real-time
Taxonomy v5.0.2

Unified Subject Ontology

Hierarchical classification system mapping disciplines, sub-fields, and cross-disciplinary tags with semantic relationships.

Format: OWL, JSON
Size: 120 MB
Records: 145K
Updated: Monthly

Data Quality & Methodology

🔍

Multi-Source Verification

Every data point is cross-referenced against peer-reviewed literature, primary archives, and institutional databases before ingestion.

🔄

Semantic Versioning

All datasets follow SemVer. Breaking schema changes trigger major version bumps, with full migration guides and diff reports.

⏱️

Predictable Update Cadence

Incremental patches daily, minor releases bi-weekly, major structural updates quarterly. Full changelogs available via Git.

🛡️

Reproducibility First

Every dataset includes pipeline configs, seed hashes, and environment manifests to guarantee exact replication of results.

Developer API & Integration

Access the full knowledge graph programmatically. Rate-limited to 1000 req/min for free tier, with webhooks and batch export options for enterprise.

# Query entities with confidence filtering curl -X GET \ "https://api.aevum.edu/v4/entities?q=quantum+entanglement&min_conf=0.92" \ -H "Authorization: Bearer YOUR_API_KEY" \ -H "Accept: application/ld+json"
GET /v4/entities Search & filter knowledge nodes
GET /v4/graph/triples Export relation triples
POST /v4/batch/export Asynchronous dataset generation
GET /v4/metadata/versions Dataset version history & diffs

API Tiers & Limits

Free Tier1,000 req/min
Academic10,000 req/min + bulk export
EnterpriseUnlimited + SLA support
AuthOAuth 2.0 / API Keys
FormatsJSON-LD, CSV, Parquet, RDF
📜 CC BY-SA 4.0

All datasets, knowledge graphs, and textual corpora are released under the Creative Commons Attribution-ShareAlike 4.0 International License. You are free to share and adapt, provided you give appropriate credit and distribute derivatives under the same license.

GDPR Compliant
No PII Included
Open Standards (RFC 8259, RDF 1.1)
Audit-Ready Provenance