profile-pic

Aswathy Raj

Vetted Talent

Aswathy Raj

Vetted Talent

Diligent engineer with 12+ years of experience which includes contributions in data science and engineering,

development of software framework, platforms, applications and customer interaction with multilingual and

multicultural clients. An effective team player and well versed in various platforms, programming languages

and programming with different databases. Also have extensive experience in all phases of software

development, and on waterfall and agile methods of project life cycle.

  • Role

    Senior Data Engineer

  • Years of Experience

    12.00 years

Skillsets

  • Nsis
  • Airflow
  • Ant script
  • Aws
  • Business Intelligence
  • Ci/ cd implementation
  • Client Management
  • Databricks
  • Eclipse
  • Github
  • Implementation support
  • It infra management
  • Jira
  • Jupyter notebook
  • MySQL
  • Data Integration - 4 Years
  • Providing product demo
  • PyCharm
  • Reporting & documentation
  • Requirement Gathering
  • SQLite
  • Svn
  • VC++
  • Visual Studio
  • Data insights & strategy
  • Data analytics dashboard
  • Support software development
  • Tender proposals
  • Jaspersoft ireport designer 5.1.0
  • SQL - 8 Years
  • Azure DevOps - 1 Years
  • ETL processes - 4 Years
  • Data Modeling - 4 Years
  • Snowflake - 4 Years
  • ADLS Gen 2.0 - 1 Years
  • Java - 2 Years
  • PostgreSQL - 0.5 Years
  • Python - 4 Years
  • Mssql - 2 Years
  • Data Migration - 4 Years
  • AWS Expertise - 4 Years
  • Etl development - 4 Years
  • SQL Proficiency - 8 Years
  • Azure Data Factory - 1 Years
  • Redshift - 1 Years
  • NoSql - 0.5 Years
  • ETL - 4 Years
  • S3 - 4 Years
  • Asp.net
  • C++
  • Pyspark
  • Vc++
  • Ant script
  • Tcl/tk script
  • AWS - 4 Years
  • large datasets - 4 Years
  • DBT - 1 Years

Vetted For

11Skills
  • Roles & Skills
  • Results
  • Details
  • icon-skill_image
    Senior Data EngineerAI Screening
  • 62%
    icon-arrow-down
  • Skills assessed :BigQuery, AWS, Big Data Technology, ETL, NoSql, Pyspark, Snowflake, Linux, Problem Solving Attitude, Python, SQL
  • Score: 56/90

Professional Summary

12.00Years
  • Oct, 2020 - Present4 yr 6 months

    Consultant | Data Science Engineer

    Sinergia Media Labs
  • Jan, 2013 - May, 20163 yr 4 months

    IT Consultant

    AI Rawahy Technical Services
  • Sep, 2006 - Nov, 20104 yr 2 months

    Software Engineer

    Huawei

Applications & Tools Known

  • icon-tool

    Airflow

  • icon-tool

    Pycharm

  • icon-tool

    Jupyter Notebook

  • icon-tool

    Eclipse

  • icon-tool

    Visual Studio

  • icon-tool

    Jupyter Notebook

  • icon-tool

    GitHub

  • icon-tool

    SVN

Work History

12.00Years

Consultant | Data Science Engineer

Sinergia Media Labs
Oct, 2020 - Present4 yr 6 months
    Developing machine learning applications according to requirements along with selecting appropriate datasets and data representation methods Researching and implementing appropriate ML algorithms and tools Running machine learning tests and experiments along with performing statistical analysis and fine-tuning using test results Developing and maintaining databases, and data systems reorganizing data in a readable format Filtering Data by reviewing reports and performance indicators to identify and correct code problems Using statistical tools to identify, analyse & interpret patterns in complex data sets Preparing final analysis reports for the stakeholders to understand the data-analysis steps, enabling them to make important decisions based on various facts and trends

IT Consultant

AI Rawahy Technical Services
Jan, 2013 - May, 20163 yr 4 months
    Planning project activities viz., scoping, estimation, tracking, change management and post-implementation support Managing end-to-end project management including project initiation, pre-sales, business proposals and responding to RFPs Handling various technical aspects like project documentation, system design & integration, coding of modules, monitoring critical paths & taking appropriate actions for multiple projects Imparting training to others on CI/CD programming languages

Software Engineer

Huawei
Sep, 2006 - Nov, 20104 yr 2 months
    Performing design and development of functional and technical solutions by evaluating the CTQs performing portfolio analysis and conducting risk management Removing corrupted data and fixing coding errors and related problems Excelling in rapid application development and management of technological issues for assigned projects, earning the highest customer satisfaction rating for all software solutions delivered Compiling the code, running lines, and performing static code analysis Executing unit tests, integration tests and end-to-end tests Defining pipeline steps in code and using Docker to create consistent build environments Integrating code changes frequently and automatically build and test code on every commit to catch

Achievements

  • Awarded for the Tableau Dataset Migration (Customer Appreciation)
  • Achieved CEO Team Award in January 2023
  • Bagged Monthly Shining Star Award for exceptional performance under pressure, meeting strict timelines, and delivering quality results during March 2022
  • Successfully worked in China for 6 months to implement urgent requirements, completed tasks with high-quality
  • Awarded for contributions towards making projects CI (Continuous Integration) compliant, including setting up Cruise Control, writing scripts in ANT, XSL, and XML, providing technical assistance, and conducting training sessions

Major Projects

8Projects

Techstyle

Mar, 2024 - Present1 yr 1 month
    Techstyle is an American fashion brand that operates in the e-commerce domain. Technologies AMWAA, Python, SQL, Snowflake, MS SQL Server, Pycharm, GitHub Accountabilities: Continuously oversee active processes, promptly identifying and resolving any issues or failures to ensure seamless operations. Design and implement new features and data pipelines to enhance functionality and efficiency. Conduct rigorous data validation to ensure accuracy, consistency, and integrity across all datasets. Troubleshoot and fix any failures, optimizing system performance for improved reliability and speed.

NBC (National Broadcasting Company)

Nov, 2021 - Feb, 20242 yr 3 months
    National Broadcasting Company is an American commercial broadcast television and radio network. Technology Used: Python, PySpark, SQL, Amazon S3, Databricks, Airflow, Snowflake, MySQL Accountabilities: Developed frameworks and pipelines for capturing data from APIs and other sources, storing it in Amazon S3 and loading it into Snowflake tables after transformation Optimization and migration of Tableau datasets - PySPark code Managed migration process from SnapLogic to Airflow and Python, implemented distributed processing with PySpark and Spark SQL in Databricks Added and maintained ETL pipelines in Airflow and optimized Spark SQL queries to reduce reporting query run times Created and managed Delta tables and evaluated data optimization technologies Developed an audit framework integrated with Python scripts Imparted training to the new hires on domains and pipelines

Social Pulse

Mar, 2023 - Sep, 2023 6 months
    In-house project to leverage the data from various social media endpoints like Youtube, Facebook, Instagram, Twitter, LinkedIn, and TikTok, to provide a reporting dashboard. Technology Used: Python, Redshift, Amazon S3, AWS QuickSight, React, node.js Accountabilities: Architected & managed the project until completion & ensured the development of the framework, and pipelines for data capturing from AP

Amgen

Jun, 2021 - Oct, 2021 4 months
    Amgen is an American multinational biopharmaceutical company. The data science project was carried out to identify the factors leading to customer/patient dropout of one of their drugs Otezla. Technology Used: ML, Python, SQL | Platforms: Databricks Accountabilities: Analysed data in a Data Lake with over 300 tables to understand the pharma domain Prepared two aggregated datasets: one at the customer level and another at the patient level Conducted Exploratory Data Analysis, handled missing values and encoded categorical data Performed feature engineering for feature elimination and developed 12 machine-learning models for classification and clustering Created an ML pipeline for model retraining

Indventor

Oct, 2020 - Jun, 2021 8 months
    Indventor Bag Valve Mask-based low-cost ventilator which is the standard method of providing rescue ventilation to patients.

In-House Project, Indventor

Oct, 2020 - Jun, 2021 8 months
    Indventor Bag Valve Mask-based low-cost ventilator which is the standard method of providing rescue ventilation to patients. Technology Used: Python, SQL, Selenium Accountabilities: Reviewed the documents & code and presented client-side product presentation Researched features as per customer request and created UI path RPA flows

Ministry of Agriculture and Fisheries, Oman

Jan, 2013 - May, 20163 yr 4 months
    The Ministry of Agriculture and Fisheries is initiated to enrich the fields related to agriculture, livestock and fisheries. The project aimed to centralize the data from various regions. Technology Used: Core Java, Jaspersoft iReport, Windows / Software configurations Accountabilities: Understood the project architecture and the functionality of the Fisheries Licensing module Interacted with the ministry to clarify requirements, ensured alignment & conducted legacy database data analysis for migration to a new database Designed & created license cards, certificates & statistical reports using iReport, and integrated these reports into the application Deployed the database and application on the ministry's centralized server.

Security Solutions, Huawei

Sep, 2006 - Nov, 20104 yr 2 months
    Huawei is an organization worldwide known for its work in telecommunication. The project aimed to enhance the security offered at the IP layer. The product contains support from IKEv1 as well as IKEv2. I worked on a project which developed applications to enhance the security of the telecom servers. Technology Used: SQLite, C++, tcl/tk, Core Java Accountabilities: Built projects to enhance the code quality by developing an on-the-fly feedback and correction system for eclipse systems. Created CI/CD pipelines for projects in co-operating command mode integration of code quality and QA tools. Provided training for the team for building continuous integration systems for projects. Developed automation suites for building libraries across various platforms and boards Implemented and managed Continuous Integration (CI) processes and conducted training for the project team Developed and implemented GUI-specific code along with analysing new requirements and designing solutions for implementation Enhanced coding skills in Core Java and Swing and gained proficiency in Oracle database administration Extended customer support for LGT, LVM & LMT and implemented logging and auditing policies Created XML configuration files based on CIS Benchmarks Parsed, retrieved, and wrote XML configuration files and conducted training sessions on using the plug-in

Education

  • M. Tech. (Data Science and Engineering)

    BITS Pilani, India (2022)
  • B.Tech. (Computer Science and Engineering)

    MG University, India (2006)

Certifications

  • Data warehousing workshop - snowflake - october 2024

  • Academy accreditation - databricks lakehouse fundamentals - march 2023

  • Basics of natural language processing using python - march 2021 (nielit)

  • Databases and sql for data science by ibm (coursera) - nov 2019

  • Introduction to git and github by google (coursera) - october 2020

  • Exploratory data analysis with python and pandas (coursera) - march 2021

  • Data warehousing workshop - snowflake - october 2024 (credential id 119306090)

  • Basics of natural language processing using python - march 2021 (nielit) - (credential id olc3190)

  • Exploratory data analysis with python and pandas (coursera) - march 2021 (credential id yv8396ns2l25)

  • Databases and sql for data science by ibm (coursera) - nov 2019 (credential id - usxtmtufvyt8)

  • Introduction to git and github by google (coursera) - october 2020 (credential id - nuwxqp5a3gte)