Senior Data Engineer
Jones LaSalle Lang Technologies(JLL), IndiaFeb, 2019 - Dec, 20201 yr 10 months
Property Web Based Product. Worked on multiple API source Ingestion, dump schema creation and entity modelling using Cosmos and Scala Azure Functions. Worked on global multi region sources and associated rule based implementation of Spark Azure Databricks notebooks driven etl region specific pipelines. Integrated entities in the property domain, using Azure Cosmos Graph and Azure Databricks Notebooks, followed by Scala web-service APIs deployed on Azure HDinsights for quick search. Worked on Streaming data Application element of the pipeline, detecting refreshes. Competitive analytics platform. Designing of, individual table based schema handling, ingestion and implementation of a data warehouse for KPI tracking, and its respective components for a full edged reporting data- warehouse. Created spark jobs for handling of daily data from Mongo, MySQL, Postgres and Folder dumps to update the data warehouses, using Airflow scheduling. Managed scaled ingestion from public competitor apis for tracking relevant parameters in analytics warehouse on Redshift. Worked on complex custom reporting spark logic driving insightful marketing strategy. Benchmarked the real-time elements of the solution with Kafka Streams.