About Me
Working as associate consultant in a reputated MNC on big data technology like spark/scala from last 3.5 year.
... Show More
Skills
Web Development
Data & Analytics
Programming Language
Database
Operating System
Development Tools
Others
Positions
Portfolio Projects
Company
Renewables and Logistics
Description
Roles and Responsibilities:
· Implemented newer concepts use like Apache Spark and Scala programming
· Managing data coming from 200+ different sources
· Loaded unstructured data into Hadoop File System( HDFS)
· Written validation and data quality scripts
· Implementation of Cloudera cluster with High availability and standby solutions.
· Design and support of Data ingestion, Data Migration and Data processing for BI and Data Analytics.
· Responsible for developing data pipeline Sqoop and pig to extract the data from weblogs and store in HDFS.
· Involved in developing Pig Scripts for change data capture and delta record processing between newly arrived data and already existing data in HDFS.
· Worked on HIVE- Integration-Spark SQL scripts for performance enhancement
· Worked on DataFrame Development as part of SparkSQL.
Show More Show LessTools
SVN