Arindam Paul


Professional Summary

Computer Scientist working in Data Mining with 5+ years research experience seeking full-time opportunities starting July 2019.

Interested in Data Mining, Deep Learning, Natural Language Processing, Artificial Intelligence

Computer Skills

  • Proficient: Python, Keras, Scikit-Learn, Tensorflow, Selenium, PySpark
  • Familiar: R, MATLAB, C, C++, Java, PHP, LAMP, SQL, weka, Gephi, Javascript, html, css, Hadoop, Mahout, MPI


Northwestern University, Evanston, Illinois

Northwestern University, Evanston, Illinois

Birla Institute of Technology & Science, Pilani, Rajasthan India

Birla Institute of Technology & Science, Pilani, Rajasthan India
  • Bachelor of Engineering (Graduated with Honors) in Chemical Engineering, 2010
  • Thesis: Detecting Sybil Attacks in P2P networks using Psychometric Analysis
  • Advisor: Prof. K. Haribabu

Professional Experience (last 5 years)

Northwestern Mutual: Jun-Aug 2018
  • Developed distributed image to text conversion algorithms for detecting responses from scanned questionnaires
  • Developed a noise reduction algorithm to denoise scanned and photocopied questionnaires

EDT: Consultant Jun 2017-Jan 2018
  • Provided subject matter expertise to develop algorithms for topic mining on legal documents
  • Assisted in designing models for profanity detection from company-wide email database

Boeing Cybersecurity (Narus Inc.): Jun-Sep 2013
  • Generated synthetic user profiles with different demographic and interest features
  • Developed a machine learning model for predicting user demographics and interests from ads


  • Walter P. Murphy Fellowship, during 1st year of PhD
  • Segal Design Cluster Fellowship, during 3rd year of PhD
  • Predictive Science & Engineering Design Cluster Fellowship, during 5th year of PhD


A. Paul, D.Jha, R. Al-Bahrani, W. Liao, A. Choudhary and A. Agrawal. CheMixNet: Mixed DNN Architectures for Predicting Chemical Properties using Multiple Molecular Representations. 2018 Conference on Neural Information Processing Systems (NIPS)

D.Jha, L.Ward, A. Paul, W. Liao, A. Agrawal, A. Choudhary and C. Wolverton. ElemNet: Deep Learning the Chemistry of Materials From Only Elemental Composition. Nature Scientific Reports, 2018

A. Paul, P. Acar, W. Liao, A. Choudhary, V. Sundararaghavan and A. Agrawal. Microstructure Optimization with Constrained Design Objectives using Machine Learning-Based Feedback-Aware Data-Generation. Journal of Computational Materials Science, 2018 (in review)

M.Mozaffar, A. Paul, R. Al-Bahrani, S. Wolff, A. Choudhary, A. Agrawal, K. Ehmann and J.Cao. Data-Driven Prediction of the High-Dimensional Thermal History in Directed Energy Deposition Processes via Recurrent Neural Networks. Manufacturing Letters, 2018

A. Paul, P. Acar, R. Liu, W. Liao, A. Choudhary, V. Sundararaghavan and A. Agrawal. Data Sampling Schemes for Microstructure Design with Vibrational Tuning Constraints. Journal of American Institute of Aeronautics and Astronautics, 2018

A. Paul,A.Agrawal, W.Liao and A.Choudhary. AnonyMine: Mining anonymous social media posts using psycho-lingual and crowd-sourced dictionaries. Workshop on Sentiment Discovery and Opinion Mining at 22nd ACM Conference on Knowledge Discovery and Data Mining,2016

J.Birnholtz, N.A.R. Merola, and A. Paul. "Is it Weird to Still Be a Virgin?": Anonymous, Locally Targeted Questions on Facebook Confession Boards. ACM Conference on Human Factors in Computing Systems 2015

A. Paul, Varuni G., J.S. Challa and Y. Sharma HADCLEAN: A Hybrid Approach for Data Cleaning Techniques in Data Warehouses, IEEE International Conference on Information Retrieval and Knowledge Management(CAMP) 2012

K Haribabu, C.Hota and A. Paul GAUR: A Method to Detect Sybil Groups in Peer-to-Peer Overlays, International Journal of Grid and Utility Computing 2012, Vol.3

A. Paul, J.S. Challa, Y.Dada, V.Nerella, P.R. Srivastava and A.P.Singh Integrated Software Quality Evaluation: A Fuzzy Multi-Criteria Approach, Journal of Information Processing Systems (JIPS) 2011 Volume 7

A. Paul ,J.S. Challa,Y. Dada, V. Nerella, P.R. Srivastava Quantification of Software Quality Parameters using Fuzzy Multi-Criteria Approach, IEEE International Conference on Process Automation Control and Computing (PACC) 2011

A. Paul,K Haribabu and C. Hota Detecting Sybils in Peer-to-Peer Overlays using Psychometric Analysis Methods, IEEE International Conference on Advanced Information Networking and Applications(AINA) 2011


  • President(2014-16), Northwestern University Cricket Club
  • President(2016-) and Treasurer(2015-16), Northwestern SpeakEasy Toastmasters Club
  • Facilitator(2016-17), Northwestern Multicultural Dialogue Group
  • Student Coordinator(2016-17), Northwestern Predictive Science & Engineering Design)
  • STEM Liaison(2014-2015), Northwestern Ethnic Students Group

Teaching and Outreach

  • Instructor, Transferable Skills Workshop on Machine Learning
  • Lecturer (and TA) for Introduction to Programming, Northwestern, Winter 2015
  • Lecturer (and TA) for Introduction to Programming, Northwestern,Winter and Spring 2014
  • TA for Data Structures, Northwestern,Fall 2015
  • Guest Lecturer for Social Media Mining, Northwestern,Spring 2016
  • TA for Database Systems, BITS Pilani, Spring 2012
  • Instructor and Mentor, Brave Initiatives

Awards & Achievements

  • Among 10 doctoral students across Northwestern selected for summer-long Research Communication Seminar Course, 2016
  • All India Rank 1 in BITS HDSAT (admission test for graduate programs at BITS Pilani)
  • All India Rank 64 & State Rank 9 in National Science Olympiad among more than half million participants during freshmen year of high-school
  • Recipient of BITS Pilani Merit-cum-Need Scholarship during last 3 years of undergraduate study


Selected Course Projects

  • Developed a Sentiment Analysis Tool to find the most interesting or controversial events at the 2013 Golden Globe Awards from user-Tweets (Python)
  • Developed a tool which uses Natural Language Processing techniques to find the most interesting or controversial events at the 2013 Golden Globe Awards from user-Tweets (Python)
  • Developed Sudoku & Othello solver using constraint satisfaction and min-max algorithms using efficient tree-based data structures and algorithms (C++)
  • Implemented a fully distributed event detection mechanism by utilizing a Kademlia-based DHT overlay network (Go)
  • Developed a web application to track a portfolio of a user’s stocks. Used data mining techniques to analyze and predict stock and portfolio performance using historical data. (Perl, SQL)

Selected Side Projects

  • Developed a tool which creates a recommendation system using ElasticSearch(Lucene) for shopping based on ”I just bought” Amazon tweets of users (Python)
  • Developed a real-time tool starts an alarm when a designated bus is ’x’ (customizable) min away from the closest bus stop by scraping CTA bus tracker webpage (Python)
  • Developed a web-automation & scraping tool which collects past news articles from the web. Used OCR recognition for getting the text from old articles (Python)

Research Mentorship

  • Debanjan Borthakur, Graduate Student, University of Rhode Island
  • Sean Chan, Undergraduate Student, Northwestern
  • Tirtha Sarathi Ghosh, Undergraduate Student, Jadavpur University, India
  • Indervir Singh, Undergraduate Student, BITS Pilani


  • Student volunteer at ACM CSCW 2014 Program Committee
  • Peer Reviewer for ICDM 2016 Conference and Workshops
  • Peer Reviewer for InfoComm 2013
  • Peer Reviewer for AIAA 2016

References (on request)

  • Dr. Alok Choudhary - Professor of EECS (Northwestern University)
  • Dr. Doug Downey - Associate Professor of EECS (Northwestern University)
  • Dr. Oliver Cossairt - Assistant Professor of EECS (Northwestern University)
  • Dr. K. Haribabu - Assistant Professor of CSIS (BITS Pilani)