Publications

(2025). Distilling Specialized Orders for Visual Generation. arXiv preprint arXiv:2504.17069.
PDF
(2025). BigDocs: An Open Dataset for Training Multimodal Models on Document and Code Tasks. International Conference on Learning Representations (ICLR).
(2025). BigCharts-R1: Enhanced Chart Reasoning with Visual Reinforcement Finetuning. Conference on Language Modeling (COLM).
(2025). AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Document Understanding. Advances in Neural Information Processing Systems (NeurIPS).
(2025). AgentAda: Skill-Adaptive Data Analytics for Tailored Insight Discovery. Workshop at Association for Computational Linguistics (ACL).
PDF
(2024). Improved Training Set Selection for Semi-Supervised Learning. US Patent App. 18/336,511.
(2024). XC-Cache: Cross-Attending to Cached Context for Efficient LLM Inference. Advances in Neural Information Processing Systems (NeurIPS).
(2024). WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?. International Conference on Machine Learning (ICML).
(2024). Towards Good Validation Metrics for Generative Models in Offline Model-Based Optimisation. Transactions on Machine Learning Research (TMLR).
PDF
(2024). RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content. Advances in Neural Information Processing Systems (NeurIPS).