Site Reliability Engineer | Online Jobs | Optimhire

Site Reliability Engineer

Mandatory skills: AWS, Linux & NewRelic


Responsibilities:

  • Help drive initiatives for designing, upgrading and scaling our architecture to improve availability, reliability, and performance.
  • Engage with engineering teams throughout the SDLC to help develop software for reliability and scale, with a shift-left mentality, ensuring minimal refactoring or changes
  • Participate in incident management and drive RCA and problem management
  • Build and maintain tools that our developers use to observe their applications
  • Write scripts to automate the provisioning, monitoring, logging, and maintenance of systems in a distributed, AWS hosted infrastructure
  • Work in an agile development environment, collaborating with Application Development and Architecture teams to implement and evolve solutions for a wide array of challenging projects
  • Develop solutions for technology, processes, people, and practices for build and release management, application lifecycle changes, operational service delivery, and support using the technologies mentioned above
  • Take an active role in developing the long-range highly-available technical infrastructure and architecture plans for cloud (AWS) infrastructure
  • Experiment with new technologies to optimize the reliability and performance of our software and hardware infrastructure
  • Experiment with new technologies to provide application health visibility to product stakeholders
  • Design and implement proactive monitoring to ensure the health, performance, reliability, and security of our environments
  • Plan, test, and monitor automated backups and disaster recovery/failover configurations
  • Produce high-quality installation and configuration documentation and processes
  • Participate in a 24x7 on-call rotation for all supported technologies and be available in the rare case of an off-hours problem
  • Reduce toil in regular infrastructure management tasks


Requirements:

  • At least two years of experience working as DevOps and/or Site Reliability Engineer
  • At least two years of experience working with AWS
  • Experience with deployment toolchains such as Jenkins or Gitlab-CI
  • Experience required with anyone logging tools like NewRelic, coralogix, elk or sumologic
  • Experience with any one monitoring tool like Prometheus and Grafana or NewRelic, Cloudwatch
  • Experience with version control system Git
  • Proficient with Unix/Linux terminal or Powershell console
  • Proficient in one or more languages such as Python or bash.
  • Knowledge of both Linux operating systems
  • Strong problem-solving and critical thinking skills
  • Excellent verbal/written communication skills and strong interpersonal skills to interact professionally and courteously with end-users and co-workers
  • Passion for learning new information and technologies


Job Type

Payroll


Positions

DevOps Engineers


Must have Skills

  • AWS - 2 Years

    Advanced

  • Linux - 2 Years

    Advanced


Languages

english - Fluent

hindi

6 - 11 K/Year USD (Annual salary)

Longterm (Duration)

Partially Remote Noida, Uttar Pradesh, India

India


Abhishek c

Payment Verified India