ARNOLD D.

ARNOLD D.

Sr. Big Data Developer/Engineer

Tampa , United States

Experience: 20 Years

ARNOLD

Tampa , United States

Sr. Big Data Developer/Engineer

153600 USD / Year

  • Immediate: Available

20 Years

Now you can Instantly Chat with ARNOLD!

About Me

Highly analytical Big Data Engineer/Developer/Architect with extensive experience in Big Data/Hadoop development and Scala programming. Enthusiastic team player with over 20 years of designing and developing innovative solutions to unusual and diffic...

Show More

Portfolio Projects

Description

4/2019 to 01/2020 Sr. Big Data Developer/Engineer (Remote Big Data Support)

Kelly Mitchell for Bayer – Maryland Heights, MO

  • Develop and maintain Big Data ETL/ELT pipelines to ingest large data from heterogenous data sources through AWS EMR with Hive and Spark to improve data ingestion process by almost 50%.
  • Improved process for several Data Engineering task to de-duplicate, transform, cleanse and enriched data for Data Science team.
  • Successfully advocated the migration of Apache Hive data processing to Apache Spark Scala to improve data processing to as much as 90% run time.
  • Significantly improved overall code quality, code coverage and code reusability by adapting TDD and Clean Code principles and processes.

Show More Show Less

Description

10/2018 to 4/2019 Sr Big Data Developer/Engineer

Iris Software for Citi – Tampa FL

  • Develop and maintain Hadoop ETL/ELT framework to ingest large data from desperate data sources using Hive and Spark to improve run-times performance reduce run time by 25%.
  • Successful transition on the use of Scala and Spark to improve productivity and testability in developing concise and testable code adhering to the clean code principles which significantly reducing bugs by up to 80%
  • Evangelization on the TDD and Clean Code principles for Scala and Spark to improve productivity in developing concise, testable, sustainable code.

Show More Show Less

Description

10/2016 to 10/2018 Sr Software/DataArchitect/Engineer (Remote Big Data Support)

T-Mobile – Bellevue, WA

  • Successful Evangelization on the use of TDD and Clean Code Scala and Spark to improve productivity by twice (2x) in development with fewer bugs when its deployed to production after achieving 80% code coverage.
  • Improved data delivery to internal customers including data scientist by as much as tenfold (10x) in the Fastdata Platform using SMACK stack to process multi-petabyte size data.
  • Develop and maintain Hadoop ETL/ELT framework to ingest large data from desperate data sources using Hive,Tez, and Spark to improve run-times up to 50% of existing code
  • Development of Machine Learning, AI application in the Azure and AWS cloud to for self-service data delivery to increase productivity 30% and reduce manpower for service desk by 40%

Show More Show Less

Description

04/2014 to 04/2015 Sr Software/Big Data Developer V (Remote Big Data Resource)

JMA International for T-Mobile - Snoqualmie, WA

  • Designed, Developed, Tested and Implemented Java, Scala application to ingest large data from different sources utilizing Hadoop distributed file system (HDFS), running Map/Reduce, Spark, Impala to improve data processing by 40% from legacy Perl ETL framework.
  • Implemented Spark, Impala, Hive, HBase, HDFS performance tuning to achieved to as much as 100% improvement in query and Map/Reduce jobs execution time.
  • Performedsystem monitoring and management of Hadoop cluster, RHEL Linux collection cluster and Apache Web server.
  • Created prototyping/proof of concept application for Business Intelligence (BI) Platform with Tableau and Zoomdata.

Show More Show Less