Rohit Mujumdar

Rohit Mujumdar

rohit dot mujumdar at yahoo dot com


I am a passionate and results-driven computer scientist with experience in machine learning, data science, and software engineering. With a Master's degree in Computer Science from Georgia Tech and extensive experience working with leading technology companies like Intel, NCR Corporation, and IBM Research, I specialize in developing innovative solutions that drive business success.

πŸ” What I Bring to the Table

🌟 Diverse Professional Experience

I've thrived in environments from a 4-employee startup in India to Intel, and from academic research at Georgia Tech to IBM Research. This diversity has honed my adaptability and ability to deliver results in varied settings.

πŸ”„ Comfort with Ambiguity

My current role has challenged me to navigate both the uncertainties of research and undefined projects. We've faced losing funding, layoff stress, changing project and business needs, and the inherent uncertainties of ML research. This experience has strengthened my ability to forge my own path, adapt to rapidly changing circumstances, and drive innovation in highly ambiguous situations.

🀝 Collaborative Spirit

While comfortable working independently, I firmly believe that when people from diverse walks of life and schools of thought team up, magic happens. I love collaborating on projects where skills complement each other. For instance, I eagerly collaborated with a sociology professor from Lilla Vicsek (University of Budapest) on developing prompts to analyze LLMs' queer value systems. My strengths in effective communication, empathy, and personal discipline make me a great team player.

πŸš€ Ownership and Proactivity

I consistently take full ownership of projects and proactively seek or create opportunities. At Intel, facing uncertain funding, I identified potential use cases for my skills outside my team. I applied my existing data science expertise to offer insightful solutions to sister teams. Simultaneously, I took the initiative to learn a completely new tech stack and skill set for GenAI applications - to deliver a RAG solution for internal document interaction, managing the project from conception to implementation.

🌍 Human-Centric Approach to Technology

I'm acutely aware of how our work in tech affects the society in nunaced, often seemingly inviisble ways, especially affecting underrepresented/historcially oppressed communities, having lived some of those experiences myself. I bring this awareness with me, this perspective being deepened through reading, my podcast interviews with diverse individuals and my role as Head TA for Georgia Tech's inaugural AI ethics course.

πŸ”“ Open Source and AI for Social Good

I support democratizing good ML and open source. Misinformation/spam/fraudulent web-traffic are some of today's digital evils, and I've witnessed their adverse effects firsthand. This drives my deep interest in combating it - I developed HawkEye, a reputation system for Twitter's Community Notes, which was the first research study on this. Our project was only possible because Twitter open-sourced their data and code. Recognizing this power, we also open-sourced HawkEye, and our ideas in-turn influenced improvements to Twitter's own Community Notes algorithm

πŸ’ƒπŸ» Personal Interests

Outside of work, I love to dance! Check out my dance videos. I am an avid practitioner of Yoga and an ardent promoter of mental health awareness.

πŸŽ₯ I run a video podcast, 'Talking To The Moon', where I interview people across professions and walks of life and listen to the stories they have to tell. More about the idea behind the vodcast in this article.

πŸ“§ Contact

Feel free to drop me an email or DM me on Twitter; I love making new friends and am always open to interesting conversations!

πŸ“š Publications
3DSP
Recognizing Similar Relationships Within Ontology to Fine Tune Ontology Neelam Chandolikar, Rishav Raj, Rohit Mujumdar
Keywords: knowledge graphs, natural language processing, semantic search engine, ontology learning, triplet extraction, education technology
ICDMAI

Ontology learning process involves identification of concepts and the relationships between these concepts. Automated ontology learning based on ML/DL techniques identifies these triples but suffers from the problem of duplicate or similar relationships. We propose a solution to identify similar relationships so that the ontology can be fine-tuned.

paper

3DSP
Overcoming Language Disparity in Online Content Classification with Multimodal Learning Gaurav Verma, Rohit Mujumdar, Zijie J. Wang, Munmun De Choudhury, Srijan Kumar
Keywords: social media, multimodal language models, language disparity, language translation
ICWSM 2022

We investigate the disparity between English and non-English language models and show that detection frameworks based on pre-trained large language models like BERT and multilingual-BERT systematically perform better on the English language. We demonstrate the promise of incorporating the information contained in images via multimodal machine learning to address this disparity.

website | paper

3DSP
HawkEye: A Robust Reputation System for Community-based Counter-Misinformation Rohit Mujumdar, Srijan Kumar
Keywords: social network analysis, misinformation, graph algorithms, adversarial attack
ASONAM 2021

We investigate the robustness of Birdwatch against adversaries and show that the current Birdwatch system is vulnerable to manipulation attacks. To overcome this vulnerability, we develop HawkEye, a cold-start-aware graph-based recursive algorithm, and show that it is more robust to such attacks.

paper | code | video

3DSP
A Heuristic Approach To Compute Service Request Resolution Time (Poster) Rohit Mujumdar, Pawan Chowdhary, Shubhi Asthana.
Keywords: time series analysis, operations research, ticket resolution, predictive model
INFORMS 2020, Best Poster Award (Honorable Mention Award)

We use statistical analyses and regression-based techniques to predict the resolution times of incident tickets. We employ techniques like dynamic rolling window, auto-regressive window-flip and artificial data creation to handle data eccentricities.

poster | video
πŸ”¬ Patents
3DSP
US20220270019A1 : Ticket-agent matching and agent skillset development Rohit Mujumdar, Shubhi Asthana, Pawan Chowdhary, Aly Megahed, Bing Zhang
Keywords: operations research, ticket resolution, employee skill management, sentiment analysis, performance review

3DSP
US20220164744A1 : Demand forecasting of service requests volume Bing Zhang, Shubhi Asthana, Pawan Chowdhary, Aly Megahed, Rohit Mujumdar, Taiga Nakamura
Keywords: operations research, ticket resolution, human-in-the-loop, time series forecast
πŸ› οΈ Selected Projects
3DSP
Exploring Fairness in Graph Embeddings Rohit Mujumdar, Sanjana Garg, Rohit Gajawada
Keywords: graph neural networks, fairness in AI, AI ethics, recommendation systems
(Web Search and Text Mining, Spring 2021. Georgia Tech)

  • Demonstrated bias in graph embeddings (generated for movie recommendation system) in existing techniques data using node2vec and metapath2vec
  • Introduced fairness mitigation methods based on Fairwalk and demonstrated recommendations with lesser bias
  • report | code

    3DSP
    Do Scientific Ideas Originating from more Prestigious Universities Spread Faster? Rohit Mujumdar, David Kartchner
    (Data Science for Epidemiology, Fall 2020. Georgia Tech)
    Keywords: epidemiology, microsoft academic graph, natural language processing, epistemology

  • Investigated the imbalance in the spread of ideas across academic research networks caused due to differences in academic prestige using disease spread models adapted from epidemiology.
  • Assessed if idea spread is driven by connectivity amongst original authors or the explicit prestige of their institution
  • website | report | code | software

    3DSP
    Can Machines Detect if you’re a Jerk? Rohit Mujumdar, Parvathy Sarat, Prathik Kaundinya, Sahith Dambekodi
    (Deep Learning, Fall 2020. Georgia Tech)
    Keywords: natural language processing, deep learning, ai ethics, reddit, computational social science

  • Used language models to assess if we can replicate the sentiments shared by Redditors and classify the Redditor's original post according to the verdict that was declared by rest of the Redditors.
  • Attempt to understand how a machine performs in a task that is entirely subjective but is possibly objective
  • report | code

    3DSP
    Conference Paper Acceptance Prediction Rohit Mujumdar, Rohan Goel, Arthita Ghosh, Shravani Sistla, Neha Pande.
    (Machine Learning, Spring 2020. Georgia Tech)
    Keywords: natural language processing, machine learning, feature engineering, peer read dataset

  • Investigated the role of peripheral features of research papers in their potential acceptability
  • Devised several innovative features such as abstract novelty and complexity, research strength score, title word-cloud etc
  • report | code | video

    3DSP
    Explainable Content Moderation Using CNNs Rohit Mujumdar, Shalini Chaudhuri, Sreehari Sreejith, Sushmita Singh
    Keywords: computer vision, image classification, content moderation, convolutional neural networks, violence detection, image flagging
    (Computer Vision, Fall 2019. Georgia Tech)

  • Built a minimal viable content moderating system to identify and flag regions of images containing violent/gory content
  • Achieved an accuracy of 0.89 by using Transfer Learning with Convolutional Neural Networks (VGG-16)
  • Captured model explainability by using Grad-CAM to generate visual explanations of the salience regions in the images
  • report | code | video

    3DSP
    Semantic Search Engine using a Dynamic Ontology Rohit Mujumdar, Poshraj Sharma, Pranjal Patil, Akanksha Patil, Dr Manasi Patwardhan.
    Keywords: natural language processing, semantic search, ontology, triplet extraction, entity-relation-entity, phrase2vec, education technology
    (Undergrad Capstone Project, 2016-17, VIT Pune)

  • Developed an e-learning platform for government schools by implementing a dynamic science ontology to store triplets (Entity-Relation-Entity) extracted from web-scraped data.
  • Devised a Phrase2Vec model driven similarity-scoring algorithm to replace similar Relations by a representative Relation.
  • report | video | code
    πŸ—‚οΈ Leadership and Volunteer Work
    πŸ“Ί In The Media

    "In the midst of winter, I found there was, within me, an invincible summer." ~ Albert Camus
    Flag Counter