Yogesh J.

Yogesh J.

Data Engineer

Pune , India

Experience: 9 Years

Yogesh

Pune , India

Data Engineer

40036.8 USD / Year

  • Notice Period: 45 Days

9 Years

Now you can Instantly Chat with Yogesh!

About Me

  • Over 9 years of IT experience, with around 7 years of experience in Spark and Hadoop Ecosystem and 3 years of experience in Web and back end development using Scala and Play-Framework.
  • Expertise in concepts of end-to-end projec...
  • Expertise in concepts of end-to-end project planning and implementation, release based maintenance, custom application development, enterprise wide application deployment, testing support.
  • Hands on experience on major components in Big Data Ecosystem like Spark.
  • Handas on experience on AWS cloud ( EMR, APPFlow, DMS, Redshift, S3, CloudFormation, Cloudwatch, Athena )
  • Experienced in processing Big data on the Apache Spark framework.
  • Excellent understanding and knowledge of NOSQL databases like Mongo DB.
  • Experience in working with UNIX/LINUX platform.
  • Experience in giving training and guiding new team members in the Project.
  • Experience in detailed system design using use case analysis, functional analysis, modelling program with class sequence, activity and state diagrams using UML.
  • Proficient in Retail and Media platform.
  • Worked on Facebook marketing API's.
  • Very good experience in customer specification study, requirements gathering, system architectural design and turning the requirements into final product.
  • Experience in interacting with customers for testing of products and services.
  • Ability to work effectively with associates at all levels within the organization.

 

Experties on  following technology:

Scala, Spark, MongoDB, Redshift, AWS and GCP, Python.

 

Show More

Portfolio Projects

Customer Data Platform

Company

Customer Data Platform

Description

Technology :

                Scala, Spark ( EMR/DataProc ), AWS, Redshift, GCP, BigQuery
Description :

               This product is enterprise customer data platform, customized for Client (the "Client Intelligence Platform", or "CIP") that will permit Client to create a 360-degree view of its customers. In this product collect client raw data and processed in AWS and GCP environment using spark,graph-x library to identify unique customer for specific client and run aggregation on that unique customer for further analytic. All processing handled in AWS and GCP using micro services and respected output stored in redshift and Bigquery.

Role :

       Work on creating python flask based micro services to trigger EMR cluster and send job details to EMR step, Implemented spark graphx library to identify unique user and maintained graph for that users.

Show More Show Less

Job Portal

Company

Job Portal

Description

Technology :

             Scala, MongoDB, Play Framework, Apache tika, GATE
 Description :

            The main focus of this products is parsing resume and analytic. Parsing in the sense of employee/consultancy upload resume and systems parsed all resume parameters and stored in DB. Employer can post jobs and employer have facility to find out top matched resumes using predefined algorithm, employer can pull resumes from other job portals. Employee can apply jobs, build analytic for displaying top matched jobs according to profile and using previous job apply history, displayed employer lists ranking wise using predefined algorithm.

Role :

         Using scala and play framework created website where register user can upload resume and based on internal algorithm identified user information like email, mobile number, skill sets, experience level, organization name etc. and stored that information in MongoDB.

Show More Show Less

Network monitoring product

Company

Network monitoring product

Description

Technology:

         Scala, Mongodb, Play framework, HTML5, CSS3, JavaScript

Description:

         It is a network monitoring solution. It collects real data in batches and can monitor private network. It can monitor Linux, Windows, vCenters, ESX, storage devices in network. It collects many parameters like CPU usage, top processes, etc. which a network administrator requires. The website displays collected data with the help of graphs.

 Role :

        Worked on to formatting incoming raw data, process that data and stored in MongoDB for further used, worked on designating mongodb collection structure. Implemented MongoDB sharding.

 

Show More Show Less

Chat Bot application

Company

Chat Bot application

Description

Technologies:

              Node-Js, Python, API.ai, MongoDB

Description:

        This chat bot gives all tax related information to user, where customer uploads there question and answers (FAQ) in excel/CSV file format and then python code process those data remove all stop words, identify Noun, Verbs, intent in sentences and based on  those data process NLP and show top matching response to user in chat box window.  


Role :

Using python NLP method identified given question meaning and based on some threshold limit fetch appropriate results from data base and show to user, also created mongo collection structure.

Show More Show Less