Document intelligence at scale — OCR, vision-language models, retrieval.
- Document intelligence pipelines combining OCR, vision-language models (VLMs) and retrieval for large-scale enterprise workflows.
- Token-efficient OOXML parsing and generation framework, enabling structured document understanding and automated document creation.
- Two ICLR 2026 workshop papers on retrieval and OCR/VLM benchmarking, and on failure attribution in document understanding systems.