NLP & AI
Mar 2025
Dr. Elena Vasquez, Marcus Chen, Dr. Aris Thorne
We propose a transformer-based architecture that generates contextual definitions for morphologically complex languages using cross-lingual transfer learning. Achieves 18% improvement over baseline LLMs on zero-shot evaluation.
Datasets
Jan 2025
Dictionary Research Team
An open, crowdsourced alignment dataset containing semantic triples, phonetic mappings, and usage contexts. Designed to benchmark cross-lingual embedding models and support dictionary construction pipelines.
Linguistics
Nov 2024
Prof. Lin Wei, Sarah Jenkins
Longitudinal analysis of lexical meaning shifts across social media platforms (2010–2024). Reveals accelerated semantic broadening in tech-adjacent terminology and proposes a computational drift metric.
Multilingual
Sep 2024
Dr. Kofi Mensah, A. Patel
Addresses the limitations of ASCII-based phonetic transcription for tonal languages. Introduces a tone-aware mapping layer that improves pronunciation accuracy by 32% in ASR pipelines.
NLP & AI
Jul 2024
J. Roberts, Dr. Elena Vasquez
A novel approach to building dense synonym networks by distilling contextual embeddings from LLMs into lightweight graph models. Enables real-time thesaurus queries with 99.4% precision.
Datasets
May 2024
Dictionary Research Team
A structured dataset tracking lexical origins across 200 years of printed corpora. Includes proto-language roots, borrowing pathways, and semantic evolution timelines.