Basant J.

Basant J.

Senior Data Engineer

Bengaluru , India

Experience: 7 Years

Basant

Bengaluru , India

Senior Data Engineer

52655.7 USD / Year

  • Notice Period: 30 Days

7 Years

Now you can Instantly Chat with Basant!

About Me

Big Data Engineer having 6+ years of experience in IT industry and Proficient in Python, Scala, Big Data, Hadoop, Hive, Map Reduce, Spark RDD, Spark SQL, Shell Script, Airflow, Snowsql, AWS EC2, S3, Glue, EMR, Microsoft Azure Blob, VM, Data Factor...

Show More

Portfolio Projects

Data ingestion and transformation from different sources using Big Data for US Retail Client

Company

Data ingestion and transformation from different sources using Big Data for US Retail Client

Role

Backend Developer

Description

  • Created Big Data Pipeline and Implemented Spark based Model for batch processing of data.
  • End to end product features development.
  • Ingesting data from sftp server to snowflake via aws s3 using JDBC connection
  • Ingesting data from sftp server to postgress via snowflake using REST API
  • Trained interns for Big data Technologies.
  • Developed pyspark code to validate data and load data to snowflake from s3  Ingesting data from Azure blob and loading the data to Postgress . Schedule the entire flow using Apache Airflow

Show More Show Less

Tools

Github

Data Ingestion And Transformation From Different Sources Using Big Data in Banking Domain

Company

Data Ingestion And Transformation From Different Sources Using Big Data in Banking Domain

Description

  • Ingest data from sql database to hive with business transformation.
  • Script creation to generate reports for BI team from Hive using HIVEQL.
  • Contributed in enhancements and performance improvement of the process.
  • Involved in performance improvement activity in Hive with Joins,Group and aggregation
  • Created automation process to schedule all scripts using Oozie/Falcon.

Show More Show Less

Tools

PyCharm

PNDA-PlatformforNetworkDataAnalytics

Company

PNDA-PlatformforNetworkDataAnalytics

Description

PNDA is an Open source Platform for Network Data Analytics.Efficiently distributes data with publish and subscribe model.Processes bulk data in batches,or streaming data in real-time.Worked as a coredeveloper of PNDA. Have added new features,fixed manyi ssues.Developed Kafka custom producers, developed some batch and stream spark applications with python for PNDA

 

 

 

Show More Show Less

Tools

PyCharm
Share:

Verifications

  • Profile Verified

  • Phone Verified

Preferred Language

  • English - Fluent

Available Timezones

  • Eastern Daylight [UTC -4]

  • Central Daylight [UTC -5]

  • Mountain Daylight [UTC -6]

  • Pacific Daylight [UTC -7]

  • Further EET [UTC +3]

  • Greenwich Mean [UTC ±0]

  • Eastern EST [UTC +3]

  • Eastern European [UTC +2]

  • Australian EDT [UTC +11]

  • Australian CDT [UTC +10:30]

  • Dubai [UTC +4]

  • New Delhi [UTC +5]

  • China (West) [UTC +6]

  • Singapore [UTC +7]

  • Hong Kong (East China) [UTC +8]