About Me
I am a big data developer having skills in Hadopp, Hive, Spark, Kafka and programming languages such as Scala, Python and Linux shell scripting. I am looking for challenging projects in Big data and data science
... Show MoreSkills
Web Development
Data & Analytics
Database
Others
Programming Language
Operating System
Positions
Portfolio Projects
Company
Telecom Data Streaming
Description
This project is regarding streaming the application data from HDFS and send to the end user directory for 8 application feeds. The Spark Streaming application is developed using Scala in a Generic Extraction Framework (GEF) model, which processes data by referring the parameters in property file.
Show More Show LessCompany
Telecom Customer Report development
Description
This project is regarding generation of Customer Score reports at different granularities in Hive based on the customer data. The customer report generated is integrated with Tableau to visualize the information generated in the system. The logic present in Hive is being migrated to Pyspark in order to reduce the execution time of the entire process and also to handle more complex data faster.
Show More Show LessTools
TableauCompany
Data Warehouse development
Description
This is regarding the design of data pipeline for the car data for different parts which are used to compare with the original data to identify the defects present in the vehicle. The data will be prepared and will be deployed in AWS cluster.
Show More Show LessSkills
Apache Spark AWS EMRTools
IntelliJ IDEA