Heyyo! I am Sumukh, and I am a Dual-Degree student at IIIT Hyderabad, working towards bachelors [B.Tech(with Honors)] and masters [MS (by research)] degrees, majoring in Computer Science, with a focus on Natural Language Processing and Machine Learning. I expect to graduate by June 2021.
I recently finished my internship at LinkedIn with the Data Science team, and have worked at startups and research labs prior to this in roles such as Software Engineer and Research intern (data science and AI/ML).
I am a member of the Machine Translation and Natural Language Processing lab, Language Technology Research Centre (LTRC), KCIS, and advised by Prof. Manish Shrivastava. I have served as a Teaching Assistant for undergrad and graduate level courses.
Research interests:
I spend my free time by reading books that interest me, exploring new places, scrolling through reddit, or planning my next trip.
I am currently actively looking for Full Time Software Development or Machine Learning/NLP roles (June 2021 start), so if you have a position available or just want to say hi, my inbox is always open!
Flagship Data Science team - LinkedIn Engineering
Used: Python, scikit-learn, gensim, Hive/SQL
Used: Python, Docker, Django, MySQL/SQL, scikit-learn, XGBoost
Used: Python, PyTorch, scikit-learn
Used: Java [Android app development], Django, MySQL, Python, ReactJS/Javascript, HTML5, CSS
Developed a model that given an arithmetic word problem, it extracts the relevant quantities,and creates the required expression tree by predicting the operators using Deep Reinforcement learning[DQN].
View ProjectDeveloped a fully functional front-end of the compiler for a custom programming language, similar to C. Built parser, scanner, abstract syntax tree, interpreter for generating intermediate representation (LLVM IR) code for an input code file.
View ProjectCreated a search engine that uses Block-Sort-Based-Indexing to create the inverted index of the entire WikiPedia dump (73.3 GB), queries on the index and retrieves top 10 results via relevance ranking of the documents, implemented using tf-idf scoring.
View ProjectImplemented a system that takes an image and a question about the image as the input, and predicts the answer to the question. Used Bilinear Attention Networks [BAN].
View ProjectDeveloped a system to identify crowd patterns by WiFi requests sent by mobile devices and triangulate client locations with WiFi routers. This data was also used to create heat-maps and perform time series analysis using ARIMA and Prophet.
View ProjectImplemented an Application Level program for a P2P-network to keep two separate directories synced, similar to Dropbox. Used sockets to communicate; maintained file-indices, and MD5 hashes on all peers.
View ProjectImplemented Phrase Based Machine Translation Model and various Neural Machine Translation Models, including one using attention with modeling coverage, for translations between Hindi and Urdu languages. [Indian Languages]
View ProjectImplemented a command line interpreter in C which supports background jobs, environment variables, signal catching, piping and redirection with extensive error-handling.
View ProjectImplemented a small SQL engine with support for basic queries, joins and aggregate functions.
View ProjectBuilt a bot for 4x4x4 ultimate tic-tac-toe game which decides the next move on the computer generated board. Was among the Top 8 bots in a class of 120 [AI bot tournament]
View ProjectSome of the mini projects that I have worked on are: Pacman killer like game (2-D) & Legend of Zelda (3D) like game (C++, OpenGL), Tweaking the Xv6 Scheduler (C), HTTP proxy server with cache (Python), Quiz webapp (Ruby on Rails).
View Project