Site Reliability Engineer

Street Contxt

Street Contxt

Software Engineering
Toronto, ON, Canada
Posted on Nov 6, 2024
Are you a Site Reliability Engineer that has a passion for building reliable, resilient and performant systems that scale? Do you command with a steady hand when incidents unfold? Are you motivated by team success? If so, continue reading…
We are on a mission to build and strengthen our engineering teams to match the accelerating success of Street Context. We provide a premium Email, Analytics and Broker Relationship platform, purpose-built for capital markets and institutional investors. Street Context has charted a course to:
Scale our system to meet increasing global demand
Meet regulatory compliance requirements in new regional markets
Productionalize and roll-out new market-validated features

I am seeking an experienced Site Reliability Engineer. I encourage you to apply if you possess:

  • Strong SRE and AWS Experience: Collaborated with development teams to enhance reliability, availability, and performance in AWS, managing both monolithic and microservices architectures.
  • Incident Response Leadership: Managed critical incidents, root cause analysis, and postmortem processes for continuous improvement.
  • Observability Expertise: Used Datadog and similar tools for proactive monitoring and incident response.
  • Change Management and CI/CD Expertise: Experience with formal Change Management process. Designed CI/CD pipelines for reliable deployments, balancing speed and stability.
  • Capacity Planning and Performance Tuning Expertise: Conducted load testing and fine-tuned systems for high availability.
  • Solid Grasp of SLOs, SLIs, SLAs, and DORA Metrics: Defined and tracked performance metrics, proactively aligning reliability work with business goals.
  • Programming Skills: Proficient in Java, JavaScript, Python, and shell scripting for automation and tooling.
  • Security-First Approach: Ensured secure configurations, following least-privilege principles.
  • Drive to Empower Teams and Reducing Toil: Mentored and equipped teams to independently enhance their applications with reliability best practices.
  • Team Integration and Trust: Contributed value as a dependable, collaborative team member.
  • Adaptability: Thrived in dynamic environments with diverse responsibilities.

For this role, you must have the following:

  • 5+ years of recent SRE experience
  • 3+ years of recent comprehensive experience with Datadog
  • 5+ years of AWS experience, leveraging core AWS services and exposure to event-driven and highly available architectures.
  • 10+ years in IT, including experience with virtualization, Linux, middleware, and various technology stacks.
Finally, in-office collaboration is integral to our team dynamic and company culture. We seek candidates who will enthusiastically embrace this aspect of our work environment, valuing the opportunities it presents for team bonding, collaboration, and innovation.
I am personally looking forward to connecting with you! Apply now to continue our conversation and learn how you can become part of the Street Context team.
Take-care,
Steve Dodd
Principle Engineer @ Street Context