Principal Data Scientist/ Head of AI / Mentor with vast experience in building AI use-cases from scratch and deploying them to production. Amazing experience in GenAi, LLm fine-tuning , RAG, vector databases, NLP, Machine learning, Deep learning, transfer learning, working with LLM, MLOPS, deployment using aws, airflow, data bricks, pyspark, pipelining, containerization. Effective and proactive communicator with experience in leading teams and projects. Expertise in Computer Vision for OCR related information extraction from images, pdf parser, XML parser, box detection, entity
detection and recognition, Data Mining, Data tagging, Data Analysis, Feature Selection & Model Selection, Model Building, Model Validation, Model threshold validation, log analysis.
Lead AI Consultant - Generative AI
SignitivesLead Data Scientist
The Weather ChannelSenior Applied AI Scientist (Lead)
WorkFusionLive Session Expert, Mentor ( Part time)
upGradData Science Mentor ( Part time)
Great LearningAI Advisory (Part time)
LineupXHead of Data Science
Oorwin Labs Pvt LtdNLP Engineer
Theatro labsNLP Engineer
Senseforth AI Research Pvt LtdSenior Systems Engineer - Data Scientist
InfosysPython
AWS (Amazon Web Services)
ML
NLP
OCR
Deep Learning
Business Analytics
DevOps
Computer Vision
Artificial Intelligence
Data Visualization
Generative AI
Docker
Azure
AWS
Tableau
GraphQL
Flask
Gunicorn
Solr
Scrapy
GCP
Airflow
MLFlow
Haystack
LangChain
weaviate
GPU
Worked on building AI features using GPT- 3 for ATS. Resume parser, requirement parser and search engine.
Achievements:
Projects:
Analytics
Skills used: Dockerization - docker images, docker containers, docker compose, aws app services, s3, azure app services, server performance monitoring, cron jobs, Python, flask, gunicorn, Linux/Ubuntu, APACHE SOLR CLOCLOUD, MS SQL, solr faceting, solr mlt, Banana dashboard, NLP - Information Extraction, NER, Apache Solr, Solr MLT. (solr more like this), Solr Indexing, Spark, topic modelling, SQL, Python, multi processing, Machine Learning, scikit- learn/sklearn, pandas, numpy, NLP, NER, Information Extraction, CRF, Word2vec, Fastext, Doc2vec, WMD words movers distance, spacy, gensim, pattern, textblob, Neural Networks, Deep Learning, ANN, CNN, RNN, LSTM, BILSTM, BERT, box detection, Web Scraping, scrapy, scrapy cloud, crawlera, selenium, splash, spiders, docker, linux/Ubuntu.
Recent implementation: Text mining / analysis using Word2vec model, Words Movers distance algorithm, Fastext model, CRF model, LSTM - CNN models, Stanford Dependency parsers. Also implemented BIDAF (Bidirectional attention flow model) of Stanford Squad dataset
Responsibilities-
Personal Implementations:
Software Engineer- (June 2015- 2016)