Client Job Description
Site Reliability Engineer (Both AVP [4-7 y.o.e] and VP [8+ y.o.e.] level openings are available at this time)
Our growing FinTech client provides access to alternative investments like private equity and hedge funds. They essentially connect investors with opportunities to diversify their portfolios beyond traditional stocks and bonds. Through their platform, investors can explore and invest in a range of alternative assets, potentially offering higher returns or lower correlation with traditional markets.
Responsibilities
- Build highly available solutions across the entire SDLC stack with primary focus on an internet facing fintech site.
- Develop and maintain tools to support the development environment on MacOS and Linux tool environment with focus on improving developer productivity.
- Maintain site reliability with a focus on building highly scalable systems, integrating resiliency and high availability at all levels.
- Develop software and tooling to secure and automate cloud infrastructure building software delivery capabilities with fully automatic workflows.
- Design and operation of a Kubernetes environment for container management and orchestration.
- Participate in on-call rotations to help understand the system while helping build tools for automation.
Qualifications
- DevOps, TechOps, or SRE experience with AWS.
- Microservices (Docker, Kubernetes) experience in a production environment strongly desired
- Strong Linux OS-level and command-line/scripting knowledge and configuration management principles
- Working knowledge of databases such as MongoDB, Postgres, DynamoDB
- Experience in architecting, implementing, and managing monitoring tools such as Prometheus/Grafana, CloudWatch, Splunk, NewRelic and ELK in the cloud
- Coding beyond simple scripting with strong opinions on maintainable/reusable code in Python, Ruby, or Java desired
- Experience with computer provisioning on a Cloud based platform using Terraform and/or Cloud formation
- Experience with distributed systems design, maintenance, and hands-on troubleshooting/debugging skills
- Exceptional analytical skills, able to apply knowledge and experience in decision-making to arrive at creative and commercial solutions
- Experience building a Microservice based architecture
- Excellent written and verbal communication skills
- Experience in updating runbooks, tools, and documentation that help the team to respond to incidents proactively
- Able to design and implement complex, but easily managed, automated infrastructure
- A desire to share, teach, and learn as part of a team
- AWS certifications are a plus