Lead DevOps/Site Reliability Engineer

Lead DevOps/Site Reliability Engineer

Infrastructure

Job Description
Lead DevOps/Site Reliability Engineer
THE OPPORTUNITY:
This is a great opportunity to create a real impact using modern technologies and cloud infrastructure. The ideal candidate for this role is quick learning and enjoys taking on the challenge.
KEY RESPONSIBILITIES
  • Build and lead a growing team of high-performing engineers by providing strong leadership
  • Help define and enforce DevOps standards, procedures, and guidelines to improve the overall development process
  • Provide technical leadership and guidance to both your team members and your peers
  • Cultivate a modern DevOps culture and mindset throughout your teams and establish performance metrics and processes to improve velocity and deliverability
  • Deploy and support high-performance, highly-available infrastructure on AWS
  • Engage in service capacity planning and demand forecasting, software performance analysis and system tuning
  • Conduct periodic on call duties
  • Own the day-to-day health, uptime, monitoring, and reliability of our cloud infrastructure
  • Work closely with engineering and project management to develop tools and solutions
  • Identify tactical issues and react to emerging areas of concern
  • Think long-term and dislike band-aids
  • Identify unnecessary complexity and remove it
REQUIRED QUALIFICATIONS
  • 5+ years experience in a SRE/SysAdmin, DevOps, or equivalent role
  • 3+ years of production AWS experience, including but not limited to EC2, RDS, ElastiCache, S3, Auto Scale, ELB
  • 2+ years of experience in technical leadership role managing DevOps in Agile environments (Scrum)
  • Capability to program in at least one language (other than Bash), ideally Python, PHP or Java
  • Solid knowledge of the Linux architecture
  • Experience with container stacks and orchestration (Docker, Kubernetes, Mesos, ECS, etc)
  • Experience with Configuration Management tools such as Puppet, Chef, Salt, or Ansible
  • Experience with Infrastructure as a Code frameworks such as Terraform
  • Experience with Microservices
  • Experience with application monitoring tools (DataDog, New Relic, SumoLogicetc)
  • Strong knowledge of core protocols and tech such as: TCP/IP, HTTP, DNS, load balancers, distributed file systems, key-value and relational databases
  • Excellent organizational skills and the ability to work in a fast-paced work environment
  • Experience with CI/CD tools and production code deployments