Heyyo! I am Sumukh, and I am a Dual-Degree student at IIIT Hyderabad, working towards bachelors [B.Tech(with Honors)] and masters [MS (by research)] degrees, majoring in Computer Science, with a focus on Natural Language Processing and Machine Learning. I expect to graduate by June 2021.
I recently finished my internship at LinkedIn with the Data Science team, and have worked at startups and research labs prior to this in roles such as Software Engineer and Research intern (data science and AI/ML).
I am a member of the Machine Translation and Natural Language Processing lab, Language Technology Research Centre (LTRC), KCIS, and advised by Prof. Manish Shrivastava. I have served as a Teaching Assistant for undergrad and graduate level courses.
Research interests:
I spend my free time by reading books that interest me, exploring new places, scrolling through reddit, or planning my next trip.
I am currently actively looking for Full Time Software Development or Machine Learning/NLP roles (June 2021 start), so if you have a position available or just want to say hi, my inbox is always open!
Flagship Data Science team - LinkedIn Engineering
Used: Python, scikit-learn, gensim, Hive/SQL
Used: Python, Docker, Django, MySQL/SQL, scikit-learn, XGBoost
Used: Python, PyTorch, scikit-learn
Used: Java [Android app development], Django, MySQL, Python, ReactJS/Javascript, HTML5, CSS
Developed a model that given an arithmetic word problem, it extracts the relevant quantities,and creates the required expression tree by predicting the operators using Deep Reinforcement learning[DQN].
View Project
Developed a fully functional front-end of the compiler for a custom programming language, similar to C. Built parser, scanner, abstract syntax tree, interpreter for generating intermediate representation (LLVM IR) code for an input code file.
View Project
Created a search engine that uses Block-Sort-Based-Indexing to create the inverted index of the entire WikiPedia dump (73.3 GB), queries on the index and retrieves top 10 results via relevance ranking of the documents, implemented using tf-idf scoring.
View Project
Implemented a system that takes an image and a question about the image as the input, and predicts the answer to the question. Used Bilinear Attention Networks [BAN].
View Project
Developed a system to identify crowd patterns by WiFi requests sent by mobile devices and triangulate client locations with WiFi routers. This data was also used to create heat-maps and perform time series analysis using ARIMA and Prophet.
View Project
Implemented an Application Level program for a P2P-network to keep two separate directories synced, similar to Dropbox. Used sockets to communicate; maintained file-indices, and MD5 hashes on all peers.
View Project
Implemented Phrase Based Machine Translation Model and various Neural Machine Translation Models, including one using attention with modeling coverage, for translations between Hindi and Urdu languages. [Indian Languages]
View Project
Implemented a command line interpreter in C which supports background jobs, environment variables, signal catching, piping and redirection with extensive error-handling.
View Project
Implemented a small SQL engine with support for basic queries, joins and aggregate functions.
View Project
Built a bot for 4x4x4 ultimate tic-tac-toe game which decides the next move on the computer generated board. Was among the Top 8 bots in a class of 120 [AI bot tournament]
View Project
Some of the mini projects that I have worked on are: Pacman killer like game (2-D) & Legend of Zelda (3D) like game (C++, OpenGL), Tweaking the Xv6 Scheduler (C), HTTP proxy server with cache (Python), Quiz webapp (Ruby on Rails).
View Project