News

Research releases, papers, awards, and milestones.

2026

4 updates
Mar 20
🚀 Released Nemotron-Cascade 2, a compact open 30B MoE with Gold Medal-level performance on the IMO, IOI, and ICPC World Finals, while using 20x fewer parameters than frontier open models. Model and data are available on Hugging Face.
Mar 11
🚀 Released Nemotron 3 Super, an open and efficient hybrid Mamba-Transformer MoE built for strong agentic reasoning.
Feb 9
🚀 Released OpenResearcher, a fully open pipeline for synthesizing long-horizon deep research trajectories with open data, models, and demo.

2025

11 updates
Dec 12
🎉 StructEval was accepted to TMLR, received the Journal-to-Conference Certificate, and will be presented at ICLR 2026.
Dec 1
🏆 Mantis received the TMLR 2025 Outstanding Paper Award.
Sep 26
🎬 Released VideoScore2, a multi-dimensional and interpretable evaluator for generative videos with detailed reasoning traces.
Sep 1
📄 Released the VerlTool technical report, highlighting a unified ARLT framework with async execution, multi-tool support, and competitive results across 6 domains.
Jun 1
🚀 Released the VerlTool codebase, a unified framework for training tool-using language agents with async rollouts and modular tool APIs.
May 20
🚀 Released General-Reasoner, extending RL-style reasoning beyond math and code with verified web-scale data and a generative answer verifier.
Mar 8
Will join NVIDIA as a research intern in Santa Clara this summer.

2024

14 updates
Jun 24
Started my internship at AI2 today!
Jun 18
I arrived in Seattle for the CVPR 2024 conference.
Apr 14
We release Mantis Mantis, enhancing LMM with Interleaved Multi-Image Instruction Tuning!
Apr 8
WildVision Arena has been accepted to CVPR 2024 demo track and will be presented at the conference!
Apr 5
Description of the image MMMU is accepted to CVPR 2024 oral presentation!
Feb 20
I am excited to announce that I will be joining AI2 Mosaic Team as a research intern this summer!
Feb 12
GenAI arena is now also online! You can test popular image generation/editing models here!

2023

5 updates
Dec 1
We release PairRM-0.4B 🤗 based on LLM-Blender!
Nov 29
We release the new benchmark Description of the image MMMU for evaluating multi-modal models!
Oct 4
Check my first work at UW: 🐯TIGERScore!
Sep 2
I arrived at University of Waterloo and started my Ph.D. journey!
Jun 5
We release LLM-Blender! It’s accepted to ACL 2023!