rohit dot mujumdar at yahoo dot com
I am a passionate and results-driven computer scientist with experience in machine learning, data science, and software engineering. With a Master's degree in Computer Science from Georgia Tech and extensive experience working with leading technology companies like Intel, NCR Corporation, and IBM Research, I specialize in developing innovative solutions that drive business success.
π What I Bring to the Table
π Diverse Professional Experience
I've thrived in environments from a 4-employee startup in India to Intel, and from academic research at Georgia Tech to IBM Research. This diversity has honed my adaptability and ability to deliver results in varied settings.
π Comfort with Ambiguity
My current role has challenged me to navigate both the uncertainties of research and undefined projects. We've faced losing funding, layoff stress, changing project and business needs, and the inherent uncertainties of ML research. This experience has strengthened my ability to forge my own path, adapt to rapidly changing circumstances, and drive innovation in highly ambiguous situations.
π€ Collaborative Spirit
While comfortable working independently, I firmly believe that when people from diverse walks of life and schools of thought team up, magic happens. I love collaborating on projects where skills complement each other. For instance, I eagerly collaborated with a sociology professor from Lilla Vicsek (University of Budapest) on developing prompts to analyze LLMs' queer value systems. My strengths in effective communication, empathy, and personal discipline make me a great team player.
π Ownership and Proactivity
I consistently take full ownership of projects and proactively seek or create opportunities. At Intel, facing uncertain funding, I identified potential use cases for my skills outside my team. I applied my existing data science expertise to offer insightful solutions to sister teams. Simultaneously, I took the initiative to learn a completely new tech stack and skill set for GenAI applications - to deliver a RAG solution for internal document interaction, managing the project from conception to implementation.
π Human-Centric Approach to Technology
I'm acutely aware of how our work in tech affects the society in nunaced, often seemingly inviisble ways, especially affecting underrepresented/historcially oppressed communities, having lived some of those experiences myself. I bring this awareness with me, this perspective being deepened through reading, my podcast interviews with diverse individuals and my role as Head TA for Georgia Tech's inaugural AI ethics course.
π Open Source and AI for Social Good
I support democratizing good ML and open source. Misinformation/spam/fraudulent web-traffic are some of today's digital evils, and I've witnessed their adverse effects firsthand. This drives my deep interest in combating it - I developed HawkEye, a reputation system for Twitter's Community Notes, which was the first research study on this. Our project was only possible because Twitter open-sourced their data and code. Recognizing this power, we also open-sourced HawkEye, and our ideas in-turn influenced improvements to Twitter's own Community Notes algorithm
ππ» Personal Interests
Outside of work, I love to dance! Check out my dance videos. I am an avid practitioner of Yoga and an ardent promoter of mental health awareness.
π₯ I run a video podcast, 'Talking To The Moon', where I interview people across professions and walks of life and listen to the stories they have to tell. More about the idea behind the vodcast in this article.
Keywords: knowledge graphs, natural language processing, semantic search engine, ontology learning, triplet extraction, education technology
ICDMAI
Ontology learning process involves identification of concepts and the relationships between these concepts. Automated ontology learning based on ML/DL techniques identifies these triples but suffers from the problem of duplicate or similar relationships. We propose a solution to identify similar relationships so that the ontology can be fine-tuned.
paperKeywords: social media, multimodal language models, language disparity, language translation
ICWSM 2022
We investigate the disparity between English and non-English language models and show that detection frameworks based on pre-trained large language models like BERT and multilingual-BERT systematically perform better on the English language. We demonstrate the promise of incorporating the information contained in images via multimodal machine learning to address this disparity.
website | paperKeywords: social network analysis, misinformation, graph algorithms, adversarial attack
ASONAM 2021
We investigate the robustness of Birdwatch against adversaries and show that the current Birdwatch system is vulnerable to manipulation attacks. To overcome this vulnerability, we develop HawkEye, a cold-start-aware graph-based recursive algorithm, and show that it is more robust to such attacks.
paper | code | videoKeywords: time series analysis, operations research, ticket resolution, predictive model
INFORMS 2020, Best Poster Award (Honorable Mention Award)
We use statistical analyses and regression-based techniques to predict the resolution times of incident tickets. We employ techniques like dynamic rolling window, auto-regressive window-flip and artificial data creation to handle data eccentricities.
poster | videoKeywords: operations research, ticket resolution, employee skill management, sentiment analysis, performance review
Keywords: operations research, ticket resolution, human-in-the-loop, time series forecast
Keywords: graph neural networks, fairness in AI, AI ethics, recommendation systems
(Web Search and Text Mining, Spring 2021. Georgia Tech)
(Data Science for Epidemiology, Fall 2020. Georgia Tech)
Keywords: epidemiology, microsoft academic graph, natural language processing, epistemology
(Deep Learning, Fall 2020. Georgia Tech)
Keywords: natural language processing, deep learning, ai ethics, reddit, computational social science
(Machine Learning, Spring 2020. Georgia Tech)
Keywords: natural language processing, machine learning, feature engineering, peer read dataset
Keywords: computer vision, image classification, content moderation, convolutional neural networks, violence detection, image flagging
(Computer Vision, Fall 2019. Georgia Tech)
Keywords: natural language processing, semantic search, ontology, triplet extraction, entity-relation-entity, phrase2vec, education technology
(Undergrad Capstone Project, 2016-17, VIT Pune)
- Reviewer β CVPR 2024, NAACL 2024, SWPC 2024 (Intel), July 2022 - March 2023
- Vice President of Public Relations β iNCRedible Toastmasters, NCR Corporation, July 2022 - March 2023
- University Recruiting Champion β NCR Corporation Recruiting, Sept 2021 - Sept 2022
- Head Teaching Assistant β AI, Ethics and Society, Georgia Tech, Fall 2020, by Dr. Ayanna Howard
- Head Teaching Assistant β AI, Ethics and Society, Georgia Tech, Spring 2020. by Dr. Ayanna Howard
- Teaching Assistant β Knowledge-Based AI, Georgia Tech, Fall 2019 by Dr. Ashok Goel
- Editor-in-Chief β Pi Editorial Board, VIT Pune, Oct 2015 to Feb 2017.
- Director Of Communications β TEDxVITPune, Oct 2015 to May 2016.
- March 2024 β Appeared on the IEEE Pune podcast 'Beyond conversations'
- Since Nov 2022 β My work on Birdwatch has been covered in various prominent platforms, including:
- Change in Twitter's Demographics (The Asian Age)
- Fighting hate speech and misinformation online, Nature Magazine
- Fixing fake news and misinformation online using Robust AI models, The AI Podcast by Jay Shah
- Advances in AI for web integrity, equity, and well-being (PubMed)
- Print/Digital newspapers like : South Asian Times,American Kahani,American Bazaar Online
- Aug 2020 β Representing IBM Research at PyBay 2020 : IBM Data Science Community News
- April 2020 β TA of the Month, Georgia Tech, OMSCS: TA Spotlight.
- April 2020 β DI Lab's Team Emprize enters XPRIZE semi-finals : Meet AI XPRIZE Semifinalist emPrize
- Jan 2020 β Design and Intelligence Lab Researcher Feature : Rohit Mujumdar (Student Researcher)
- May 2013 β In conversation with Indian Express: Candidates find MT-CET papers easy