Senior Site Reliability Engineer

Req ID: POS-48864

Apply Now

Person, Wood, Plywood, Clothing, Sitting, Indoors, Room, Furniture, Table, Long Sleeve

Senior Site Reliability Engineer

  • Ft. Lauderdale, Florida, United States

*** This role will be on site in Ft Lauderdale, FL**

You are a Senior Site Reliability Engineer that has experience managing and deploying cloud infrastructure at scale within a public cloud environment. You have worked with Infrastructure as Code and have exposure to at least one of the major cloud providers (Azure, AWS, or GCP). If you have 4+ years of SRE experience and are well versed with app development and scripting, we’d love to speak with you!

Position Overview:

In this role, which sits on the Zero Trust Network Access Team, you will play an important role in supporting the operations of large-scale, highly-available Distributed Systems. You will have an opportunity to leverage your SRE skills to ensure system administration and operations processes are running efficiently. You will be responsible for deployment automation, deployment architecture, custom operations tool development, performance scalability, cost scalability, monitoring/alerting designing/implementing for all things cloud related! This position will work out of the Fort Lauderdale, FL office.

Role Responsibilities:

  • Develop scripts and provide hands-on technical expertise to design, deploy, and optimize Cloud services

  • Improve the Security, Availability and Performance of the systems you build while managing Cloud Costs (COGS) and maintain Compliance guardrails

  • Build automation using industry tools (such as Jenkins, TeamCity, Ansible, etc.) to deploy hundreds of different services

  • Promote and contribute to best practices in library usage and end-to-end architecture

  • Work with other development teams to design scalable, robust systems using cloud native architecture principals

  • Identify and address patterns in infrastructure and applications that can be solved with a common solution

  • Participate in a 24x7 on call rotation to ensure cloud service availability

Basic Qualifications:

  • Experience managing deployments with Infrastructure as Code

  • Experience managing Cloud services and distributed systems – deployment, monitoring, scaling, debugging within one of the major public Cloud providers (Azure, AWS, or GCP)

  • Experience writing applications using C#, Python, or Java

  • Experience with scripting (PowerShell, Bash, or Python)

  • Excellent verbal and written communication skills

  • Requires practical knowledge of Site Reliability Engineering obtained through advanced education combined with experience

  • Requires a University Degree or equivalent experience and minimum 4 years of prior relevant experience; or an advanced degree without experience

Preferred Qualifications:

  • Experience with container technologies: Kubernetes, Docker

  • Experience with logging platforms and application performance metrics - NewRelic, Splunk, Application Insights

Apply Now

Not You?

You are now being redirected to complete your application