Harshitha R D

Shaping the Future with AI/ML: Transforming Insights into Impact



About

Hey there! If you've stumbled upon this little corner of the internet with the intention of hiring me, congratulations on your excellent life choices! Stick around till the end because, surprise, I crafted this digital masterpiece just for you.

I'm Harshitha, deeply passionate about AI and Data Science, with over two years of experience spanning advanced NLP models at Millennium Management to process automation at Cisco. Holding a Master's in Computer Science from New York University, my journey has been about leveraging technology to transform complex data into actionable insights and drive business efficiency. I'm on the lookout for opportunities that challenge me to further my expertise and make impactful contributions in a dynamic environment.

My rapid learning ability is a cornerstone of my career in AI and Data Science, enabling me to quickly grasp complex concepts and apply them to real-world scenarios. I'm keen to apply this capability in a role that values innovation and practical problem-solving, aiming to make a tangible impact in a forward-thinking team.

Industry Experience

I'm all about that work life. Before you judge me, hey, I'm still a newbie in the career game, so my enthusiasm hasn't had time to wear off yet!

I'm convinced that practical engagement is the most effective way to learn. My interest in Machine Learning has deepened through various industry experiences. During every summer and winter break, I've actively sought opportunities to apply my learning in professional settings. This approach has not only advanced my capabilities beyond those of my colleagues in my current full-time role but has also broadened my understanding, providing a competitive advantage. I treasure each experience for the unique insights and skills I've gained. I thrive on direct involvement and hands-on work.

Millennium Management LLC, New York

Jan 2023 – Present

Data Scientist

Part of the Systematic Data Platform.

  • Developed and deployed a Retrieval Augmented Generation framework for research document search functionality, dramatically improving user experience by providing an end-to-end solution by parsing over 35000 research documents
  • Fine-tuned NLP LLMs like Llama and BERT for classification tasks to tag research documents that identify market trends based on title and analysts and with a 87.4% accuracy
  • Research and experimented with Machine Learning and Statistical models like Trees, XGBoost and evaluated against different metrics to measure model performance and conducted A/B testing to analyze high-dimensional datasets
  • Subject Matter Expert for the Broker Research project, leading features from conception to completion, collaborating with product leaders to understand business problems and translating requirements into appropriate features
  • Orchestrated the integration and deployment of an Application Programming Interface (API) connecting SQL server and 
AWS cloud, utilizing Python Fast-API to deliver real-time data with response time under 0.5 seconds
  • Migrated the broker research application to a Yellowbrick database from SQL, significantly improving API response times.
Jun 2022 – Aug 2022

Data Science Intern

Part of the Systematic Data Platform.

  • Accomplished the automation of SLA check generation, employing Machine Learning to identify schedules, Python scripting using OOP, and TensorFlow, resulting in the creation of over 10,000 checks to ensure timely data delivery
  • Migrated 10 broker research datasets to the cloud by enhancing the ETL framework with automated data pipeline testing through Python programming, Jenkins, Git, and AWS functionalities
  • Analyzed datasets statistical methods and data visualization, identifying overlaps and summarizing insights to provide a comprehensive overview of data coverage

Cisco Systems, India

Feb, 2021 – July, 2021

Technical Undergraduate Software Intern

Hired On-campus from my Undergrad college, I interned at cisco for a period of 6 months. I was hired to assume the role of network security engineer for one of Cisco's largest client - Bank of America. Here was my transformation from only knowing the basics of Networking to being a Cisco Certified Network Associate (CCNA) and Cisco Certified DevNet Associate.

  • Awarded "Intern of the Month – June 2021" for playing a pivotal role in successfully delivering a production-ready Triage project
  • Implemented a Tree-based ML algorithm to achieve 99% accuracy in classifying network issues into categories identifying those fixable by automation versus manual intervention, streamlining the troubleshooting process
  • Achieved a 25% reduction in downtime handling latency by automating the troubleshooting process using Python through a triage project

Palms Connect LLC, San Diego

June, 2020 – Sept, 2020

Analytics Intern

My networking with people from Linkedin brought me in touch with the CEO of the startup. Being an analyst for 4 months brought me to the realisation that there is always more to raw data than what meets the eye. With the help of data analyzation tools like Machine Learning and Tableau, I generated the best insights to the most sensitive data there is - Medical.

  • Predicted survival months for cancer patients with 0.91 recall (metric chosen to evaluvate unbalanced dataset) based on Machine Learning Algorithms such as Random Forest Regressor, Gradient Boosted Trees and Artificial Neural Networks.
  • Conducted comprehensive feature analysis and visualized data insights using Matplotlib, Seaborn, and Tableau for effective storytelling

National University Singapore, Singapore

Dec, 2019 – Jan 2020, 2021

Deep Learning Analytics Intern

Clearing the eligibity test, I made it to this university known for its academic and research excellence. The program was for 5 weeks and included training classes that helped us develop projects post training sessions. This intensive internship experience strengthened my Machine Learning and Deep Learning concepts.

  • Excelled in a rigorous 5-week training program on Machine Learning, Deep Learning and Reinforcement Learning led by NUS faculty and Hewlett Packard Enterprise, securing a distinguished 90% grade for overall performance
  • Engineered a full-fledged application using Python’s Flask framework for ongoing inference for the image classifier.
  • Built and maintained a real-time inference application using Python’s Flask framework, setting up a continuous deployment pipeline that facilitated ongoing model updates and performance monitoring

Indian Institue of Technology - Madras (IIT-M)

November, 2018 - Jan, 2019

Software and Reserch Intern

  • Developed a front-end in flask to that can accept a config file to run automated tests on the RISC-V platform. Tests check if all the functionalities are executed to perfection in the processor. Processor is highly configurable and tests are curated based on the coniguration of the processor
  • Rewrote test codes to migrate away from the primary research platform to the fully completed platform with slightly different architecture.

Developer Intern

May, 2018 - November, 2018

As a fresher to any exposure IIT-M was the first institute that was open to hiring me during my first summer vacation. I joined a team of researchers working to build a platform that can support the development of processors that cater other research purposes. Continued to offer assistance even after summer break remotely.

  • Developed test codes in Reduced Instruction Set Architecture assembly language for processor functionality testing.
  • Collaborated with researchers to understand the development and Worked with a team of 10 members including researchers from UC Berkeley.

Skills

Most skills that I developed for myself are out of interest. I love to code and to learn something new everyday. I am always interested in new technology and look forward to a learning opportunity.

Python 100%
TensorFlow 100%
Machine Learning 100%
C++100%
Hadoop 100%
PySpark 100%
AWS 100%
MySQL 100%
R 90%
PyTorch 100%
Generative AI 80%
Natural Language Processing 100%
Bash 100%
Fast-API 100%
Tableau 90%
Git 100%
Jenkins 80%
Linux 100%

Certifications and Publications

My "Publications and Certifications" section is something I'm genuinely proud of. It's not just a list; it represents my journey in research and learning. Although it's still evolving, it shows my goal to become a leading researcher. I believe that with the right resources and connections, I can make a meaningful impact in my field. Each item in this section is a step forward in my career, demonstrating my commitment to improving and contributing to my area of study. This part of my profile isn't just for show—it's a key part of who I am as a professional, underlining my ongoing efforts to learn, grow, and achieve my ambitions.

Click on title to view my publication

Google Certified - TensorFlow Developer

This level one certificate exam tests a developers foundational knowledge of integrating machine learning into tools and applications. The certificate program requires an understanding of building TensorFlow models using Computer Vision, Convolutional Neural Networks, Natural Language Processing, and real-world image data and strategies.

Deep Learning for Automated Detection of Lung Cancer from Medical Imaging Data

This research paper presents a comprehensive study on the application of deep learning for the automated detection of lung cancer from medical imaging data, primarily focusing on chest X-rays and computed tomography (CT) scans.

Temporal Analysis of Human Serum Albumin with Recurrent Neural Networks for Changepoint Detection and Prediction

This paper’s objective is two-fold: (i) Predicting the direction of the protein movement in terms of these distances and (ii) temporal analysis of these distances to detect changepoints and identify intervals of high-binding affinity. Prediction of the direction is achieved using a recurrent neural network (RNN).

Ethical Considerations in AI-assisted Decision-Making for End-Of-Life Care in Healthcare

This paper delves into the ethical implications of deploying artificial intelligence (AI) in decision-making processes related to end-of-life care within healthcare settings.

Security Challenges and Solutions in AI-Enhanced Cloud Platforms: A Comprehensive Study

This comprehensive study delves into the security intricacies arising from the amalgamation of AI and cloud platforms, aiming to identify, analyze, and propose effective strategies to address these challenges.

Improved Scheme for Cluster Based Fault Tolerant Data Aggregation in Wireless Sensor Networks

Paper describes the use of TCP protocol within clusters to minize energy and maximised network communication through the shortest routes

Green IoT (G-IoT): an Insight on green computing for greening the future

In the era of computer how to responsiblly utilize the internet of Things for a mindful future and greener Future

Projects

All the different projects that I have worked on. Click for details

Testimonials

The keen observational abilities that Harshitha possesses have always made the classes very entertaining and interactive. Harshitha has proven herself to carry the perseverance, motivation, and intellectual ability necessary to perform at any given role.

Sudha N

Professor & SASTRA Deemed to be University

In her third and fourth semester, Harshitha wrote a couple of book chapters on green computing and AI for agriculture for Springer and has co-authored a paper in networking with me. Brainstorming sessions with her can be an eye-opener to new perspectives and ideas.

Suriya Prabha

Associate Professor, SASTRA Deemed to be University.

Displaying a keen interest in any task assigned to her, Harshitha always performed her duties to perfection. Her most remarkable talent lies in her problem-solving ability. Possessing good programming skills, she could solve a seemingly complicated problem with a perfect set of lines of code. She could be exposed to an alien environment and still perform exceedingly well.

Lavanya Jagan

Project Manager, IIT Madras

Contact

Please reach me at:

Location:

New York, NY - 10029

Call:

+1 6465787799

>