Nishit Neema
Applied ML @ Cerebras Systems, M. Tech(Computer Science & Automation), IISc Bengaluru
Bengaluru, India
I'm Nishit Neema, an Applied ML Researcher at Cerebras Systems, working on post-training for large language models. My current focus is long-horizon reinforcement learning — developing stable, scalable training recipes for state-of-the-art open-source models including the Qwen, GLM, Kimi, and other frontier model families. At Cerebras, my research spans inference-time orchestration (CePO — enabling a 32B model to surpass much larger frontier models on AIME and LiveCodeBench), RL-based verifier training for calibrated test-time reasoning, curriculum learning (ACER — synthesizing structured training data to develop domain expertise without catastrophic forgetting), and RL reward design (CoRPO — addressing fundamental flaws in GRPO's handling of ordinal rewards). Before Cerebras, I completed my M.Tech. in Computer Science at IISc Bengaluru, where I worked on diffusion-based generative models, resulting in MILD (Momentum-Imbued Langevin Dynamics) — a faster sampler for score-based models, published at IEEE ICASSP 2024.
Selected Publications
Journey so far..
| Sep 2025 | Paper accepted in NeurIPS |
|---|---|
| July 2024 | Joined Cerebras Systems |
| Dec 2023 | Paper accepted in IEEE ICASSP 2024 |
| May 2023 | Joined Spectrum Lab under supervision of Dr. Chandra Sekhar Seelamantula |
| July 2022 | Started M. Tech. programm in IISc in CSA Department |
| March 2022 | Achieved All India Rank 37 in GATE CS |