Vishal J.

Vishal J.

Data Engineer with Strong knowledge on AWS, Python, SQL, BigData

Bengaluru , India

Experience: 4 Years

Vishal

Bengaluru , India

Data Engineer with Strong knowledge on AWS, Python, SQL, BigData

USD / Year

  • Start Date / Notice Period end date:

4 Years

Now you can Instantly Chat with Vishal!

About Me

I have strong experience with the Following technologies 
Programming Languages:  Python, SQL,
Python libraries: pandas, NumPy, scipy, pytz, matplotlib, os, csv, pyscopg2, mysql.connector, sqlalchemy, p...

  • Building end-to-end data pipeline from scratch using Python and AWS services 
  • Writing clean and optimize AWS lambda functions using python. 
  • Building the lambda packages
  • Converting server application to serverless 
  • Developing the new functionalities for the project 
  • Production deployments

Projects
Document Digitization (extracting data from pdf and convert it into the csv, json, xml format)

  • Build the end-to-end data processing pipeline using AWS Lambda, s3, sqs. 
  • Developed the automated pipeline for reporting using AWS lambda, PostgreSQL, python, SES

PDF data extractor (extract the data from pdf)
        build a complete end to project with backend and front end

  • For the backend I have used flask, python, pandas, openCV to extract the data and the Mysql database to store this data 
  • For the frontend, I used the bootstrap, html, css, jinja templating language
  • for hosting this website I used the aws ec2 instance

Predictive works ( The aim of the project is to predict the anomalies before occurring it)

  • In this project, I have build end to end data collection ETL pipline 
  • This pipeline fetches the data from different databases and stores it in the s3 bucket
  • The code host on the AWS lambda and written in python it fetches record from different production database and store in s3 bucket 

Show More

Portfolio Projects

Description

Document digitization

  • extracting data from pdf and convert it into different format such as JSON, XML, csv as per customer requirement
  • building end to end pipelines
  • building automation piplines for reporting

Show More Show Less

Description

Design a database schema, Writing models, view/functionality for the application

● Hosing the application on AWS - ec2 windows instance, RDS.

Show More Show Less

Description

The aim of the project is to predict the major incident before occurring.

● Data Collection from different databases

● Writing an ETL script using python.

● Creating an end to end data pipelines.

● Hosting a project in a cloud with Aws services like- S3, ec2, Lambda, Batch.

● Building a Docker image for AWS Batch

● Design a Memory management module using python for AWS lambda.

Show More Show Less

Description

Setup the ELK Stack on the production system.

● Building a single and multimeric machine learning job using Elasticsearch, Logstach, and kibana.

Show More Show Less

Description

Migrating in premises system/setup to cloud.

Migrating Ec2 instances, Dynamo DB tables, elastic cache from one account to another account(STG to PRD) using the AWScloud formation scripts.

● Cassandra Production deployment- Creating, configuring and adding the new Cassandra node into the existing ring

● Jira Administration - Create/update/modify Screens, Workflow, fields, and projects

● Chef installation on different AWS servers

● AWS Trust advisor: - Reduce the billing of the existing system

Show More Show Less