Document understanding and generation grounded in verifiable rewards — OCR, vision-language models and retrieval.
- Reinforcement learning for document generation. Recast office document generation with verifiable rewards — executable checks replacing an LLM-as-judge — and used them to build a token-efficient OOXML library that beats the OpenAI and Anthropic document skills at ~half the tokens while scoring 40% higher on reward.
- Retrieval & document understanding. Built OCR / VLM / retrieval pipelines for large-scale enterprise document workflows, and showed BM25 rivals multimodal retrievers in visually-rich RAG [ICLR Workshop], when OCR beats VLMs, and vice versa (DISCO) [ICLR Workshop], plus corpus-level QA and failure attribution.