
About
Hey there! If you've stumbled upon this little corner of the internet with the intention of hiring me, congratulations on your excellent life choices! Stick around till the end because, surprise, I crafted this digital masterpiece just for you. I'm Harshitha, deeply passionate about AI and Data Science, with over two years of experience spanning advanced NLP models at Millennium Management to process automation at Cisco. Holding a Master's in Computer Science from New York University, my journey has been about leveraging technology to transform complex data into actionable insights and drive business efficiency. I'm on the lookout for opportunities that challenge me to further my expertise and make impactful contributions in a dynamic environment. My rapid learning ability is a cornerstone of my career in AI and Data Science, enabling me to quickly grasp complex concepts and apply them to real-world scenarios. I'm keen to apply this capability in a role that values innovation and practical problem-solving, aiming to make a tangible impact in a forward-thinking team.
Industry Experience
I'm all about that work life. Before you judge me, hey, I'm still a newbie in the career game, so my enthusiasm hasn't had time to wear off yet! I'm convinced that practical engagement is the most effective way to learn. My interest in Machine Learning has deepened through various industry experiences. During every summer and winter break, I've actively sought opportunities to apply my learning in professional settings. This approach has not only advanced my capabilities beyond those of my colleagues in my current full-time role but has also broadened my understanding, providing a competitive advantage. I treasure each experience for the unique insights and skills I've gained. I thrive on direct involvement and hands-on work.
Millennium Management LLC, New York
Jan 2023 – Present
Data Scientist - AI/ML
Part of the Systematic Data Platform.
- Developed and deployed a Retrieval Augmented Generation framework for research document search functionality, dramatically improving user experience by providing an end-to-end solution by parsing over 35000 research documents
- Fine-tuned NLP LLMs like Llama and BERT for classification tasks to tag research documents that identify market trends based on title and analysts and with a 87.4% accuracy
- Developed and maintained predictive models using Machine Learning regression and tree-based models on time-series data for forecasting sales volume, contributing to company revenue prediction and market trend analysis on large structured and unstructured web-scraping datasets
- Led a team of 4 as the subject matter expert for the Broker Research project, conducting Unit Tests, adhering to agile methodologies and collaborating closely with Product Managers to translate requirements into operational features, ensuring alignment with roadmaps and expectations
- Presented Exploratory Data Analysis (EDA) using Big Data Analytics and Data Science Techniques on market datasets, identifying trends, and providing insights into data coverage and variable correlations. Conducted hypothesis testing to validate findings, enhancing data-driven decision-making processes
- Migrated the broker research application to a Yellowbrick database from SQL, significantly improving API response times.
Jun 2022 – Aug 2022
Data Science/AI Intern
Part of the Systematic Data Platform.
- Accomplished the automation of SLA check generation, employing Machine Learning to identify schedules, Python scripting using OOP, and TensorFlow, resulting in the creation of over 10,000 checks to ensure timely data delivery
- Migrated 10 broker research datasets to the cloud by enhancing the ETL framework with automated data pipeline testing through Python programming, Jenkins, Git, and AWS functionalities
- Analyzed datasets statistical methods and data visualization, identifying overlaps and summarizing insights to provide a comprehensive overview of data coverage
Cisco Systems, India
Feb, 2021 – July, 2021
Technical Undergraduate Inten
Hired On-campus from my Undergrad college, I interned at cisco for a period of 6 months. I was hired to assume the role of network security engineer for one of Cisco's largest client - Bank of America. Here was my transformation from only knowing the basics of Networking to being a Cisco Certified Network Associate (CCNA) and Cisco Certified DevNet Associate.
- Awarded "Intern of the Month – June 2021" for playing a pivotal role in successfully delivering a production-ready Triage project
- Implemented a Tree-based ML algorithm to achieve 99% accuracy in classifying network issues into categories identifying those fixable by automation versus manual intervention, streamlining the troubleshooting process
- Achieved a 25% reduction in downtime handling latency by automating the troubleshooting process using Python through a triage project
Palms Connect LLC, San Diego
June, 2020 – Sept, 2020
Analytics Intern
My networking with people from Linkedin brought me in touch with the CEO of the startup. Being an analyst for 4 months brought me to the realisation that there is always more to raw data than what meets the eye. With the help of data analyzation tools like Machine Learning and Tableau, I generated the best insights to the most sensitive data there is - Medical.
- Predicted survival months for cancer patients with 0.91 recall (metric chosen to evaluvate unbalanced dataset) based on Machine Learning Algorithms such as Random Forest Regressor, Gradient Boosted Trees and Artificial Neural Networks.
- Conducted comprehensive feature analysis and visualized data insights using Matplotlib, Seaborn, and Tableau for effective storytelling
National University Singapore, Singapore
Dec, 2019 – Jan 2020, 2021
Machine Learning Intern
Clearing the eligibity test, I made it to this university known for its academic and research excellence. The program was for 5 weeks and included training classes that helped us develop projects post training sessions. This intensive internship experience strengthened my Machine Learning and Deep Learning concepts.
- Excelled in a rigorous 5-week training program on Machine Learning, Deep Learning and Reinforcement Learning led by NUS faculty and Hewlett Packard Enterprise, securing a distinguished 90% grade for overall performance
- Engineered a full-fledged application using Python’s Flask framework for ongoing inference for the image classifier.
- Built and maintained a real-time inference application using Python’s Flask framework, setting up a continuous deployment pipeline that facilitated ongoing model updates and performance monitoring
Indian Institue of Technology - Madras (IIT-M)
November, 2018 - Jan, 2019
Software and Reserch Intern
- Developed a front-end in flask to that can accept a config file to run automated tests on the RISC-V platform. Tests check if all the functionalities are executed to perfection in the processor. Processor is highly configurable and tests are curated based on the coniguration of the processor
- Rewrote test codes to migrate away from the primary research platform to the fully completed platform with slightly different architecture.
Research Intern
May, 2018 - November, 2018
As a fresher to any exposure IIT-M was the first institute that was open to hiring me during my first summer vacation. I joined a team of researchers working to build a platform that can support the development of processors that cater other research purposes. Continued to offer assistance even after summer break remotely.
- Developed test codes in Reduced Instruction Set Architecture assembly language for processor functionality testing.
- Collaborated with researchers to understand the development and Worked with a team of 10 members including researchers from UC Berkeley.
Skills
Most skills that I developed for myself are out of interest. I love to code and to learn something new everyday. I am always interested in new technology and look forward to a learning opportunity.
Certifications and Publications
My "Publications and Certifications" section is something I'm genuinely proud of. It's not just a list; it represents my journey in research and learning. Although it's still evolving, it shows my goal to become a leading researcher. I believe that with the right resources and connections, I can make a meaningful impact in my field. Each item in this section is a step forward in my career, demonstrating my commitment to improving and contributing to my area of study. This part of my profile isn't just for show—it's a key part of who I am as a professional, underlining my ongoing efforts to learn, grow, and achieve my ambitions. Click on title to view my publication
Google Certified - TensorFlow Developer
This level one certificate exam tests a developers foundational knowledge of integrating machine learning into tools and applications. The certificate program requires an understanding of building TensorFlow models using Computer Vision, Convolutional Neural Networks, Natural Language Processing, and real-world image data and strategies.
Deep Learning for Automated Detection of Lung Cancer from Medical Imaging Data
This research paper presents a comprehensive study on the application of deep learning for the automated detection of lung cancer from medical imaging data, primarily focusing on chest X-rays and computed tomography (CT) scans.
Temporal Analysis of Human Serum Albumin with Recurrent Neural Networks for Changepoint Detection and Prediction
This paper’s objective is two-fold: (i) Predicting the direction of the protein movement in terms of these distances and (ii) temporal analysis of these distances to detect changepoints and identify intervals of high-binding affinity. Prediction of the direction is achieved using a recurrent neural network (RNN).
Ethical Considerations in AI-assisted Decision-Making for End-Of-Life Care in Healthcare
This paper delves into the ethical implications of deploying artificial intelligence (AI) in decision-making processes related to end-of-life care within healthcare settings.
Security Challenges and Solutions in AI-Enhanced Cloud Platforms: A Comprehensive Study
This comprehensive study delves into the security intricacies arising from the amalgamation of AI and cloud platforms, aiming to identify, analyze, and propose effective strategies to address these challenges.
Improved Scheme for Cluster Based Fault Tolerant Data Aggregation in Wireless Sensor Networks
Paper describes the use of TCP protocol within clusters to minize energy and maximised network communication through the shortest routes
Green IoT (G-IoT): an Insight on green computing for greening the future
In the era of computer how to responsiblly utilize the internet of Things for a mindful future and greener Future
Testimonials
Contact
Please reach me at:
Location:
New York, NY - 10029
Email:
harshithard05@gmail.com
Call:
+1 6465787799