Research

Academic Publications

TEEMIL: Towards Educational MCQ Difficulty Estimation in Indic Languages

The 31st International Conference on Computational Linguistics (COLING 2025)

Ravikiran, M., Vohra, S., Verma, R., Saluja, R., & Bhavsar, A. (2025). TEEMIL: Towards Educational MCQ Difficulty Estimation in Indic Languages. The 31st International Conference on Computational Linguistics (COLING 2025). ACL Anthology.

You Reap What You Sow—Revisiting Intra-class Variations and Seed Selection in Temporal Ensembling for Image Classification

International Conference on Frontiers in Computing and Systems (COMSYS 2023)

Ravikiran, M., Vohra, S., Nonaka, Y., Kumar, S., Sen, S., Mariyasagayam, N., & Banerjee, K. (2023). You Reap What You Sow—Revisiting Intra-class Variations and Seed Selection in Temporal Ensembling for Image Classification. In Proceedings of International Conference on Frontiers in Computing and Systems (pp. 73-82). Springer, Singapore.

(Manikandan Ravikiran and Siddharth Vohra—Both authors contributed equally. Names are ordered alphabetically)

Investigating the Effect of Intraclass Variability in Temporal Ensembling

arXiv preprint (2020)

Vohra, S., & Ravikiran, M. (2020, August 21). Investigating the effect of intraclass variability in temporal ensembling. arXiv.org

Technical Blogs

Using WAF with App Runner in Copilot

February 23, 2023

Vohra, S. (2023, February 23). Using WAF with app runner in copilot. AWS Copilot CLI Blog

Research Experience

Independent Research

June 2020 - Present

Conducted independent research with mentors from Hitachi R&D and Thoughtworks, spanning traditional machine learning, large language models (LLMs), and computer vision
Co‑authored a COLING 2025 paper introducing a multiple‑choice question benchmark for Hindi and Kannada educational texts and providing LLM‑based baseline difficulty‑estimation models.
Co‑authored a AIED 2025 submission investigating whether semantic similarity between answer options can predict question difficulty
Benchmarked Temporal Ensembling semi‑supervised models across diverse vision datasets, analysing intra‑class variability impacts in collaboration with Hitachi R&D; findings published at COMSYS 2023
Initiated and led a project to benchmark coding‑question difficulty using LLMs, developing a BERT‑style model for automatic programming‑problem complexity estimation.
Explored computer‑vision techniques for pedestrian‑attribute recognition and person re‑identification, evaluating multiple network architectures to improve identification accuracy in security applications.
Built reproducible pipelines to fine‑tune GPT, Claude, Llama and Gemma models with LoRA/QLoRA, enabling single‑GPU experimentation.