profile-pic

Sai Vignan Malyala

Sai Vignan Malyala

Principal Data Scientist/ Head of AI / Mentor with vast experience in building AI use-cases from scratch and deploying them to production. Amazing experience in GenAi, LLm fine-tuning , RAG, vector databases, NLP, Machine learning, Deep learning, transfer learning, working with LLM, MLOPS, deployment using aws, airflow, data bricks, pyspark, pipelining, containerization. Effective and proactive communicator with experience in leading teams and projects. Expertise in Computer Vision for OCR related information extraction from images, pdf parser, XML parser, box detection, entity

detection and recognition, Data Mining, Data tagging, Data Analysis, Feature Selection & Model Selection, Model Building, Model Validation, Model threshold validation, log analysis.

  • Role

    Head of AI

  • Years of Experience

    9 years

  • Professional Portfolio

    View here

Skillsets

  • LLMs - 9 Years
  • Automated Testing - 2 Years
  • CI/CD - 3 Years
  • Data Structure - 1 Years
  • MLFlow - 2 Years
  • Natural Language Processing - 9 Years
  • Sagemaker - 4 Years
  • TensorFlow - 2 Years
  • Jenkins - 2 Years
  • SQL - 6 Years
  • Docker - 9 Years
  • Kubernetes - 2 Years
  • Problem Solving - 8 Years
  • Classification - 8 Years
  • Cnn - 5 Years
  • AWS - 6 Years
  • ML libraries - 2 Years
  • Regression - 7 Years
  • Rnn - 2 Years
  • Scikit-learn - 8 Years
  • Supervised ml - 8 Years
  • NLP/LLM - 9 Years
  • Third party api - 9 Years
  • GCP - 1 Years
  • JavaScript - 1 Years
  • PostgreSQL - 1 Years
  • GPT-4 - 3 Years
  • RESTful API - 8 Years
  • Teamwork - 8 Years
  • Deep Learning - 7 Years
  • Machine Learning - 8 Years
  • NLP - 8 Years
  • Data Science
  • Leadership - 4.5 Years
  • Statistics - 7 Years
  • Data Modelling - 7 Years
  • Cloud DevOps - 4 Years
  • Product Management - 2 Years
  • Pytorch - 6 Years
  • OCR
  • Computer Vision
  • Keras
  • DevOps
  • Python - 9 Years
  • Data Modeling
  • Data Visualization
  • Deployment
  • Business analytics
  • Transfer learning
  • Generative AI space - 3 Years
  • Large Language Model (LLM) - 4 Years
  • Vector Database - 4 Years
  • langchain.js - 1.5 Years
  • Prompt engineering - 2 Years
  • Vector databases - 4 Years
  • TypeScript - 1 Years
  • Nodejs - 1 Years

Professional Summary

9Years
  • Mar, 2023 - Jun, 20241 yr 3 months

    Lead Data Scientist

    The Weather Channel
  • Mar, 2023 - Dec, 2023 9 months

    Lead AI Consultant - Generative AI

    Signitives
  • Aug, 2022 - Feb, 2023 6 months

    Senior Applied AI Scientist (Lead)

    WorkFusion
  • Jun, 2021 - Aug, 20221 yr 2 months

    Data Science Mentor ( Part time)

    Great Learning
  • Oct, 2018 - Aug, 20223 yr 10 months

    Head of Data Science

    Oorwin Labs Pvt Ltd
  • Aug, 2022 - Feb, 2023 6 months

    AI Advisory (Part time)

    LineupX
  • Jul, 2019 - Aug, 20223 yr 1 month

    Live Session Expert, Mentor ( Part time)

    upGrad
  • Jun, 2018 - Oct, 2018 4 months

    NLP Engineer

    Theatro labs
  • Feb, 2018 - Jun, 2018 4 months

    NLP Engineer

    Senseforth AI Research Pvt Ltd
  • Jun, 2015 - Jan, 20182 yr 7 months

    Senior Systems Engineer - Data Scientist

    Infosys

Applications & Tools Known

  • icon-tool

    Python

  • icon-tool

    AWS (Amazon Web Services)

  • icon-tool

    ML

  • icon-tool

    NLP

  • icon-tool

    OCR

  • icon-tool

    Deep Learning

  • icon-tool

    Business Analytics

  • icon-tool

    DevOps

  • icon-tool

    Computer Vision

  • icon-tool

    Artificial Intelligence

  • icon-tool

    Data Visualization

  • icon-tool

    Generative AI

  • icon-tool

    Docker

  • icon-tool

    Azure

  • icon-tool

    AWS

  • icon-tool

    Tableau

  • icon-tool

    GraphQL

  • icon-tool

    Flask

  • icon-tool

    Gunicorn

  • icon-tool

    Solr

  • icon-tool

    Scrapy

  • icon-tool

    GCP

  • icon-tool

    Airflow

  • icon-tool

    MLFlow

  • icon-tool

    Haystack

  • icon-tool

    LangChain

  • icon-tool

    weaviate

  • icon-tool

    GPU

Work History

9Years

Lead Data Scientist

The Weather Channel
Mar, 2023 - Jun, 20241 yr 3 months
    • I worked on building prediction models for First Party Audience. Some of the prediction models are Homeowners prediction, business travelers, PSA users prediction, T2D users prediction, Asthma users prediction, etc., for IOS, android and Web based users. Used features based on MSA, Social determinants, behavioural data, etc. Leveraged using Machine learning, Deep learning for training models. Have done a lot of EDA to find right features and soft labels. Have worked on building effective evaluation metrics along with using advanced probability based correlation on MSA data.
    • Worked on building applications using LLMs, vector databases like weaviate/elastic, RAG. Have built & evaluated search functionalities, content generation & relevance, RAG chatbots on filtered contents. (multiple usecases). Applied Generative AI on Weather article relevancy checks with multiple data points.
    • Along with using existing LLMs like openai GPT4, mistral, etc., I have finetuned LLMs like gemma for content evaluation. I have also evaluated MTEB embedding models for articles search usecases.
    • Worked on Large scale data modelling of around 400 features and 600M records. On daily basis, the client get 3M new records into the database. Worked with Huggingface, RAG, weaviate, fastapi, ragas, PySpark, EMR, Sagemaker pipelines, python, PyTorch, sklearn, Aws Athena, gin configs. fastapi. dockerization & deployment with CI/CD & jenkins. Built Gen AI frameworks & large data pipelines

Lead AI Consultant - Generative AI

Signitives
Mar, 2023 - Dec, 2023 9 months
    • Domain based Article Generation for Sports & betting companies based on brand style & language. Sports covered NFL, NBA. Use Generative AI for custom fine-tuning on styles and sport scenarios. Lead the team and product from scratch and delivered.
    • Repair Manual RAG chatbot, automobile understanding knowledge bots, Document extraction API's for auto-insurance sector.
    • Built LLM chatbots recruiter chatbot, hardware faucet stores software with RAG
    • Built article generation software with LLMs for sports media companies. Understood the domain and scenarios of the product. Proactively worked as a Gen AI product person and developed the LLM based product.
    • Built recursive candidate validation software for recruiter analytics involving GenAI and scraping for client. This will effectively extract relevant data of candidates from web and do a personality analysis.
    • Built automatic SRS document maker, refiner, troubleshooter (involves question extractor, tech stack, requirements, team mix, complexity) and Launch first estimator with LLMs
    • Fine-tuned custom LLMs on custom data for text to sql queries, tech specific code retriever (used pangucoder method), recipe maker, cricket analyst with PEFT. Worked on GPU nodes for doing pre-training and instruction fine-tuning on LLMs. Have good understanding of quantization, GGML/UF models, ollama frameworks, llama-index, langchain, advanced RAG. Have also create custom datasets with the idea of WizardLLM.
    • Have implemented customer support dialogue chatbot on custom data. Used LLMs, OpenAI GPT3.5, langchain, python, fastapi, aws, prompt engineering, pinecone for the client Have built ad recommendation pipelines and models on big data. Have used with aws sagemaker, aws EMR,
    • Aws athena, spark, python, bigdata processing, machine learning, CI/CD, jenkins, MLFlow, SQL, Athena SQL, pyspark, pytorch, aws sagemaker, extensive datanalaysis, sagemaker pipelines, deployment, dockerization - model buiding & data pipelines for the client

Senior Applied AI Scientist (Lead)

WorkFusion
Aug, 2022 - Feb, 2023 6 months
    • Implemented Table Detection and Table Structure. Detection on banking documents using Cascade TabNet, Table transformer and Layout LM. (LLM)
    • LLM's Implemented Native PDF Extraction using fitz and paddle OCR.
    • This helped the company to reduce costs on 3rd party APIs for information extraction. I have generated models with 94% accuracy on bank documents equivalent to 3rd party APIs.

AI Advisory (Part time)

LineupX
Aug, 2022 - Feb, 2023 6 months

    Worked on building AI features using GPT- 3 for ATS. Resume parser, requirement parser and search engine.

Head of Data Science

Oorwin Labs Pvt Ltd
Oct, 2018 - Aug, 20223 yr 10 months
    • Contributed to build AI platform from scratch. My use-cases & work help company reduce costs and increase Customer acquisition.
    • Worked with GPT-3, Open AI, BigBert, LLM (large language models) as well Core Competencies NLP, Python, Devops ,Data Modeling Statistics, Data Visualization, Business Analytics, Deployment.
    • Leadership: Team Building, Team Training
    • Product Development & Project Management (ATS Staffing & Recruitment Analytics, HR Analytics, CRM Analytics)
    • Analysis: Analyzing Requirements by coordinating with Clients and Product managers based on possibility, need and Usage.
    • Deployment: AI models & server Deployment, Maintenance. Dockerization of our product services.
    • Analytics: Market, Customer and Business Analytics - Choosing next use cases.
    • Recommendation Systems, Text Similarity, Document Similarity, News detection - Data Architecture And Design, Designing Approach, Building custom algorithms and Custom frameworks.
    • Pre-screening Chatbots, Employer Chatbots. Research: Computer Vision based Video Interviews via Deepface - emotion capture, video analysis, audio analysis, Face verification, speech to text, etc.

    Achievements:

    • Built Resume Parser & JD parser by training India, US and Singapore based resumes/jds considering both technical and non-technical. (A unique method for data extraction with great tools are also part of this and OCR)
    • Achieved 84% F1-score on combined parsed fields. Reduced the 3rd party api costs for the company. Currently Parsers can handle parsing of 2L resumes daily via api and data migration together. Parsers could rapidly work with a response time of 200 ms on average.
    • Implemented job scraping engine and analytics from over 65 job boards and prime vendors scraping 1000 jobs every 5 mins.
    • Built pre-screening chatbot that saves users time by reducing follow-ups and time management. Effectively used feature by over 1600 users on daily basis for capturing data of around 10000 candidates.
    • Pre-screening Chatbots, Employer Chatbots
    • Research: Computer Vision based Video Interviews via Deepface - emotion capture, video analysis, audio analysis, Face verification, speech to text, etc. Pipelines: Client Data Migration - Automation.

    Projects:

    • Resume Parser, Job Description Parser, Email Parser, Document Parser, LCA Parser - NLP Information Extraction, Intent detection, Topic modelling, Custom Entity extraction - Email Classification, Document Classification, Domain Classification, Resume Classification - NLP, Document conversions, Automatic Search Query builder.
    • Implemented Keyword Search and Semantic Search for our Candidate Ranking System - (accurate and fast indexing search) - (Advanced search capabilities and filters) Implemented Candidate Matching & Ranking, Candidate

    Analytics

    • Built Scraping Engines for our use case to grab jobs, candidate profiles, recruitment data/news - Leveraged GPU For training on high-end algorithms fastly and accurately Email verification and IMAP
    • Computer Vision for OCR related information extraction from images, pdf parser, xml parser, box detection, entity detection and recognition.
    • Data Mining, Data tagging, Data Analysis, Feature Selection & Model Selection, Model Building, Model Validation, Model threshold validation, log analysis.

    Skills used: Dockerization - docker images, docker containers, docker compose, aws app services, s3, azure app services, server performance monitoring, cron jobs, Python, flask, gunicorn, Linux/Ubuntu, APACHE SOLR CLOCLOUD, MS SQL, solr faceting, solr mlt, Banana dashboard, NLP - Information Extraction, NER, Apache Solr, Solr MLT. (solr more like this), Solr Indexing, Spark, topic modelling, SQL, Python, multi processing, Machine Learning, scikit- learn/sklearn, pandas, numpy, NLP, NER, Information Extraction, CRF, Word2vec, Fastext, Doc2vec, WMD words movers distance, spacy, gensim, pattern, textblob, Neural Networks, Deep Learning, ANN, CNN, RNN, LSTM, BILSTM, BERT, box detection, Web Scraping, scrapy, scrapy cloud, crawlera, selenium, splash, spiders, docker, linux/Ubuntu.

Data Science Mentor ( Part time)

Great Learning
Jun, 2021 - Aug, 20221 yr 2 months
    • Handled 4 batches till now (DLCP-PGP August 2021 batch, PGP-ML December 2021 batch)
    • + Course given to Jain University PGDM in collaboration with Great Learning.
    • + Course giving to IIIT-H PGDM in collaboration with Great Learning.
    • (Python, SQL, DSA, Data Visualization, ML, DL, NLP, Recommendation Systems)

Live Session Expert, Mentor ( Part time)

upGrad
Jul, 2019 - Aug, 20223 yr 1 month
    • Have mentored groups of students from 8 batches (DCS27, DSC23, MLC21, DSC14, DSC11, MLC8, DSC11 career phase, MLC8 career phase)
    • Mentoring & Training Students into Data Science, NLP
    • Building Professional Confidence to students / mentees
    • Giving Live sessions, group sessions, concept wise recordings, seminars, webinars on various AI topics
    • Doubt resolution, grading
    • Career guidance

NLP Engineer

Theatro labs
Jun, 2018 - Oct, 2018 4 months
    • Worked on Audio codec analysis (mainly for G711, Speex, Opus, AMR, Flac) for our ASR engine with respect to frame Sizes, bit rates, sample rates, audio quality, etc., for our Products / environments, requirements and end-to-end Implementations.
    • Worked on NLP chatbot using Google Dialogflow & RASA - Streaming audio and integrated it with our product using GRPC CPP / Python Application. Implemented it convert our Command based product to free flowing human language Product !
    • Log data analysis, time based analysis for Nuance Speech engines. Training Speech system with phenomes inputs.
    • Text mining / analysis using Word2vec model, CRF model,LSTM - CNN models.

    Recent implementation: Text mining / analysis using Word2vec model, Words Movers distance algorithm, Fastext model, CRF model, LSTM - CNN models, Stanford Dependency parsers. Also implemented BIDAF (Bidirectional attention flow model) of Stanford Squad dataset

NLP Engineer

Senseforth AI Research Pvt Ltd
Feb, 2018 - Jun, 2018 4 months

    Responsibilities-

    • Developed Chatbots using NLP / NLU - GATE text Engineering ? Extracting intents/entities from text (randomized sentences written in different ways), scoring, Making synonym module, antonym module, randomized Time and date extraction module, phrase extraction Modules, etc. Integrated all to my chatbot framework. Worked on this for Banking, Telecom, IT-Support, Finance Domains.
    • Implemented MIS Chatbot, which in summary stores Context of previous queries of the user and also shows Visualization for the finance related queries from the Database. (Here, we find out the intent of randomized Finance query, make an SQL query using that intent and Extract data from database and show visualization to user. Based on the user query, our chatbot can decide to reply or Show data or show visualizations)
    • Made and used vast number of modules using GATE at Senseforth for intent / phrases identification from query Texts and documents.
    • Also worked with word2vec, fastext, WMD, dependency Parser for text analysis using Python.

Senior Systems Engineer - Data Scientist

Infosys
Jun, 2015 - Jan, 20182 yr 7 months
    • Worked on Candidate Rankings and Candidate Comparison use case using Machine learning Random forest for our client data. Created a Tableau visualization for it.
    • Worked on clustering use-case for telematic data to Analyze achievable profits for the client. Worked on telecom Data to analyze achievable profits for the client. Prescriptive analytics. - Worked on Email Sentimental Analysis problem for my client using Bag-of-words and Random forest model (Text Classification)
    • Worked on fraudulent claim predictions, client subscription. Analysis. - predictive analytics. - Worked on Chiller plants Maintenance data using Linear Regression Analysis, Exploratory Analysis, Association Analysis to get the insights From data using R, Python and thereby detecting spikes of The plant.- Descriptive analytics, EDA.

    Personal Implementations:

    • First Chat-bot : Made a simple FAQ Chatbot using cosine similarity, python- NLTK, pandas, numpy libraries. Also used web scraping / crawling to get live data.
    • Solved Loan prediction problem using Decision Tree/Random forest algorithms.
    • Extracted & Worked of twitter data, facebook data to do exploratory analysis.

    Software Engineer- (June 2015- 2016)

    • Worked on solving Cross-Site Scripting Vulnerabilities using Cold Fusion technology.
    • Worked on developing and enhancing the features of web applications using ASP.NET 2016
    • Worked on customer data and its intricacies using MS CRM and MS NAV tools. -2016
    • Learnt python (ds,oops), dbms, integration of py & dbms
    • .Net - C#(basic & advanced), webforms, ADO.NET, ASP.NET MVC etc.
    • Worked on developing python applications for airline industry.
    • Worked on developing ASP.NET (MVC, webforms also) based applications for insurance Industry.

Achievements

  • Oorwin Awards - Awarded for building NLP Parsers effectively in short span with atmost metrics that helped company reduce heavy costs
  • Data Science Mentor & Industry Tutor Mentored & Taught around batches (9 month courses) on weekends in Upgrad & Great Learning out of interest in teaching subject rightly AI advisory Worked as AI advisory for startups Spiritualist Have great interest in philiosphy and service for higher purpose.
  • Promoted to Head of Data Science for building AI platform, research & strategy
  • Going above & beyond award National Talent Search Exam - NTSE Achieved State wide 3rd rank in NTSE Exam and qualified Nationals
  • Awarded for building NLP Parsers effectively in short time
  • Promoted to Head of Data Science for building AI platform
  • State wide 3rd rank in National Talent Search Exam (NTSE)

Education

  • MBA/PGDM

    Aegis School of Data Science (2017)
  • B.Tech/B.E

    SRM university (SRMU) (2015)
  • B.Tech/B.E.

    SRM university (SRMU) (2015)

Certifications

  • Courses & Certifications

  • Product management from institute of product leadership (ipl)

  • Blockchain - advanced distributed ledger technology from iiit hyderabad

Interests

  • Watching Movies
  • Driving
  • Games