About

I am a full-time Ph.D. scholar in the Department of Computer Science at IIIT-Delhi. I am fortunate to work under the supervision of Prof. Anubha Gupta. I am part of the SBILab at IIIT-Delhi. My work lies at the intersection of Speech processing, Natural Language Processing (NLP), and Human-Computer Interaction (HCI). My research focuses on developing robust, cost-efficient, and inclusive speech technologies, with a particular emphasis on low-resource languages. I was recently honored with the Outstanding Paper Award at ACL 2025 (Core A*) for my paper IndicSynth, which presents a large-scale synthetic speech dataset for low-resource Indian languages. Before joining the Ph.D. program in August 2019, I worked as a Teaching Fellow in the Department of Computer Science at IIITD between August 2018 and July 2019. With experience in both research and teaching, I am passionate about mentoring students and fostering collaborations. I am currently seeking faculty opportunities where I can contribute to cutting-edge research and impactful teaching in speech and language processing.

Research Interests:
  • Speech and Language Processing
  • Low-Resource Speech Processing
  • Cost-Efficient Speech Processing
  • Ethics, Bias, and Fairness in Speech Processing
  • Multilingual and Cross-Lingual Speech Processing
  • Generalizable and Inclusive Speech Processing

Email: divyas@iiitd.ac.in

LinkedIn | Google Scholar | ResearchGate | Semantic Scholar | Twitter

Education

  • M.Tech (CSE), IIIT-Delhi (2016-2018)

    CGPA: 8.55

    Thesis: Context-Aware RNN Based Voice Authentication System

  • B.Tech (CSE), RKGITM (2011–2015)

    Percentage: 80.68%

    Project: HiFi – Delivering Thoughts (A social networking website with speech-to-text conversion)

Skills

  • Programming Languages

    Python, C, C++, Java, C#

  • Tools & Technologies

    Visual Studio, Eclipse, Netbeans, Django, Pycharm, Latex, SQL Server, PostgreSQL, Wireshark, Oracle, Firebase, Proto.io

  • Courses

    Database Systems Implementation, Data Mining, Graduate Algorithms, Introduction to Spatial Computing, Object Oriented Programming and Design, Research Methods, Graph Theory, Information Retrieval, Foundations to Computer Security, Wireless Networks, Designing Human Centered Systems, Natural Language Processing, Topics in Adaptive Security, Optimization Methods for Machine Learning

  • Coursera

    Deep Learning Specialization, Natural Language Processing Specialization

Teaching

  • Teaching Fellow at IIIT-Delhi

    Advanced Programming (Aug,18-Dec,18)

    Data Structures and Algorithms (Jan,19-June,19)

    Algorithm Design and Analysis (Jan,19-Apr,19)

    Refresher module: Data Structures and Algorithms (July,19)

  • Teaching Assistant at IIIT-Delhi

    Systems Programming (Aug,16-Dec,16)

    Introduction to Media in Society(Jan,17-Apr,17)

    Cloud Computing (Aug,17-Dec,17)

    Data Structures and Algorithms (Jan,18-Apr,18)

    Refresher module on Data Structures and Algorithms for MTech ECE students (July, 2018)

    Foundations to Computer Security (Aug,19-Dec,19)

    Advanced Programming (Aug,20-Dec,20)

    Database Management Systems (Jan,21-Apr,21)

    Introduction to Programming (Dec,21-Apr,22)

    Data Science (Aug,23-Dec,23)

    Introduction to Programming (Aug,24-Dec,24)

    Mobile Computing (Jan,25-May,25)

    Software Development using Open Source (April,25-May,25)

Publications

  • 🏆 Outstanding Paper Award
    Divya Sharma, Vijval Ekbote, Anubha Gupta. 2025. IndicSynth: A Large-Scale Multilingual Synthetic Speech Dataset for Low-Resource Indian Languages. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 22037–22060, Vienna, Austria. Association for Computational Linguistics: ACL 2025 [ Core Rank A* | h5-index: 215 | Long paper][Paper Link] [Dataset Link]
  • Divya Sharma. 2024. EcoSpeak: Cost-Efficient Bias Mitigation for Partially Cross-Lingual Speaker Verification. In Findings of the Association for Computational Linguistics: NAACL 2024, pages 379–394, Mexico City, Mexico. Association for Computational Linguistics. [ Core Rank A | h5-index: 132 | Long paper] [Paper Link]
  • Divya Sharma and Arun Balaji Buduru. 2022. FAtNet: Cost-Effective Approach Towards Mitigating the Linguistic Bias in Speaker Verification Systems. In Findings of the Association for Computational Linguistics: NAACL 2022 , pages 1247–1258, Seattle, United States. Association for Computational Linguistics. [ Core Rank A | h5-index: 132 | Long paper] [Paper Link | Code]

Services

  • Research Mentorship
    • Vijval Ekbote (BTech 3rd Year Student, IIIT-Delhi): Aug, 2024 - Present
    • Sarthak Kandpal (BTech 3rd Year Student, IIIT-Delhi): Aug, 2024 - Present
    • Swati Sharma (BTech 4th Year Student, IIIT-Delhi): Jan, 2025 - Present
    • Divyasha Priyadarshini (BTech 3rd Year Student, IIIT-Delhi): Aug, 2024 - Dec, 2024
  • Reviewing Service
    • Reviewer for ACL Rolling Review
    • Received the Great Review recognition in the ARR February 2025 Cycle.
  • Served as an in-person volunteer at the ACL 2025 conference held in Vienna, Austria.

Talks

  • [28th July, 2025] Presented my paper titled, "IndicSynth: A Large-Scale Multilingual Synthetic Speech Dataset for Low-Resource Indian Languages" in the ACL 2025
  • [13th June, 2024] Presented my paper titled, "EcoSpeak: Cost-Efficient Bias Mitigation for Partially Cross-Lingual Speaker Verification" in the NAACL 2024 [ Recorded Presentation ]
  • [15th October, 2022] Presented my work on "Investigating Cost-Effective Solutions towards Mitigating the Linguistic Bias in Speaker Verification" at the ACM India Ph.D. students Meet-up at IIIT Delhi
  • [13th July, 2022] Presented my paper titled, "FAtNet: Cost-Effective Approach Towards Mitigating the Linguistic Bias in Speaker Verification Systems" in the NAACL 2022 [ Recorded Presentation ]

Achievements

  • Received the Outstanding Paper Award in the ACL 2025 (Core A* Conference)
  • Received the Great Review recognition in the ARR February 2025 Cycle
  • Qualified CBSE-UGC NET November 2017 exam
  • Secured 1st Rank in B.Tech CSE department
  • Qualified GATE 2015
  • Microsoft Technology Associate for .Net Fundamentals

Updates

  • [30th July, 2025] Honored to receive the Outstanding Paper Award at ACL 2025 for our paper "IndicSynth: A Large-Scale Multilingual Synthetic Speech Dataset for Low-Resource Indian Languages".
    ACL Award 1 ACL Award 2 ACL Award 3
  • [16th May, 2025] Thrilled to announce that our paper "IndicSynth: A Large-Scale Multilingual Synthetic Speech Dataset for Low-Resource Indian Languages" has been accepted to the Main Conference of the ACL 2025. ACL is a Core A* conference in the field of computational linguistics. Authors: Divya V Sharma, Vijval Ekbote, Anubha Gupta.

  • [25th April, 2025] Honored to receive the Great Review Recognition in the ACL Rolling Review February 2025 Cycle! This recognition is given for reviews deemed strong, decisive, helpful, and well-written by Area Chairs. As a reviewer, I strive to support the community by providing constructive, fair, and insightful feedback to authors. I am grateful for this recognition and look forward to continue supporting high-quality research through the review process! [Details].

  • [13th March, 2024] Delighted to share that my paper titled "EcoSpeak: Cost-Efficient Bias Mitigation for Partially Cross-Lingual Speaker Verification" has been accepted to appear in the Findings of NAACL 2024.

  • [15th October, 2022] Presented my work on "Investigating Cost-Effective Solutions towards Mitigating the Linguistic Bias in Speaker Verification" at the ACM India Ph.D. students Meet-up at IIIT Delhi

  • [19th August, 2022] Successfully cleared my Ph.D. comprehensive exam !!!
  • [13th July, 2022]: Super happy to give a flash talk to present my paper at the NAACL 2022!
  • [31st May, 2022]: Memorable moments
  • [8th April, 2022]: My paper titled, "FAtNet: Cost-Effective Approach Towards Mitigating the Linguistic Bias in Speaker Verification Systems" got accepted to appear in the Findings of the Association for Computational Linguistics: NAACL 2022 .