SRE Senior Engineer

Location: Union, New Jersey
Date Posted: 06-12-2018
The Site Reliability Team at our client is looking for Senior Site Reliability Engineers (SRE’s) who want to build highly scalable and stable systems. Our Site Reliability team has access to a production class performance testing lab where we build out and test our designs using the latest technologies. Our team blends a variety of skill sets and works collaboratively to ensure not only that we deliver quality releases, but also take an active role in determining what architectures and technologies perform, scale and deliver services reliably. If you're looking for an opportunity to deepen your expertise and learn new things, this job is for you. 

Responsibilities
  • Engage at all levels across the organization 
  • Design, build, test, and size systems
  • Automating monitoring, alerting, and escalation
  • Troubleshoot issues across the entire stack - hardware, software, applications and network
  • Define and validate current and future configuration processes and policies 
  • Take part in a 24x7 on-call rotation

Qualifications 
  • 8+ years of experience as an SRE 
  • Experience responding to catastrophic Sev 1 events and being an active participant in resolution 
  • Hands on experience building fault tolerant infrastructure containing Kubernetes, Kafka, Cassandra at scale. You know what the pitfalls are and build/test to prevent them 
  • Hands on experience building and using monitoring tools such as CA, Nagios, InfluxDB, Grafana, Prometheus, Stack Driver, etc.
  • AWS / Google Cloud experience is required 
  • Hands on experience using one of the following configuration management tools such as Puppet, Ansible, Salt, Chef, or CFEngine 
  • Familiarity with log analysis tools such as SumoLogic, ELK, and Splunk 
  • Knowledge of RUM tools and APM tools such as NewRelic, Webpagetest, mPulse, boomerang.js, etc. 
  • Demonstrable knowledge of TCP/IP, HTTP, security, and storage
  • Practical knowledge of shell scripting and at least one scripting language (Python, Ruby)
  • Detail oriented, passionate, even obsessive 
  • Most importantly, you must be highly intellectually curious, independent, but still a team player
or
this job portal is powered by CATS