NewYorkRecruiter Since 2001
the smart solution for New York jobs

Site Reliability Engineer

Company: Recruiting From Scratch
Location: New York
Posted on: March 26, 2025

Job Description:

Who is :Recruiting from Scratch is a talent firm that focuses on placing the best candidate for our clients.DevOps Engineer - Cutting-Edge Financial AI StartupTitle: DevOps EngineerLocation: New York CityCompany Stage of Funding: Series AOffice Type: Onsite (Beautiful NYC Office)Salary: $140,000 - $180,000 DOECompany DescriptionOur client is building revolutionary AI thought partners that are transforming how knowledge is created and shared in financial services. They're an unabashedly ambitious team with exceptional traction among the world's largest investment banks, hedge funds, and private equity firms. With a lean, smart, and enormously ambitious team working from their beautiful NYC office, they're on track to become the biggest Financial AI company in the world.This isn't just another tech startup - they're pushing the boundaries of published research, redefining what's possible with AI, and literally inventing the future of financial knowledge work. Their platform is crazily powerful, creating tools that make people smarter and revolutionizing how knowledge is discovered, created, and shared.What You Will Do

  • Infrastructure Magic: Design, deploy, and maintain cloud infrastructure on AWS and/or Azure that powers cutting-edge AI systems, ensuring high availability and resilience.
  • Performance Guardian: Implement and manage sophisticated monitoring solutions using Datadog to proactively identify and address system issues before they impact users.
  • Kubernetes Commander: Manage production Kubernetes clusters and utilize Helm for package management and deployment automation.
  • Automation Wizard: Develop and maintain Infrastructure as Code (IaC) using Terraform and create automation scripts in Bash or Python to streamline operations.
  • Collaboration Champion: Work closely with development and operations teams to foster DevOps culture, share best practices, and ensure seamless integration and deployment processes.
  • Problem Solver: Troubleshoot and resolve complex cross-platform issues related to OS, networking, and databases in a cloud-based environment.
  • Knowledge Keeper: Maintain comprehensive documentation of system configurations, procedures, and troubleshooting guides.Ideal Candidate Background
    • Bachelor's degree in Computer Science, Information Technology, or related field.
    • 3-5 years of hands-on experience with AWS and/or Azure cloud platforms, including services like EC2, S3, VPC, and Lambda.
    • 2-3 years of experience managing Kubernetes clusters in production environments.
    • 2-3 years of experience with Helm for Kubernetes package management.
    • 2-3 years of experience with Datadog or similar monitoring tools.
    • 3-5 years of experience with Linux system administration and shell scripting.
    • 2-3 years of experience with Infrastructure as Code (IaC) tools like Terraform.
    • Proficiency in scripting languages such as Bash and Python.
    • Strong understanding of networking fundamentals, including TCP/IP, DNS, and load balancing.
    • Experience with CI/CD pipelines and tools like Jenkins, GitLab CI, or GitHub Actions.
    • Knowledge of cloud-native security best practices and compliance frameworks.
    • Excellent problem-solving skills and ability to navigate complex challenges effectively.
    • Strong communication and collaboration skills.Preferred Qualifications
      • Experience with MLOps monitoring and observability.
      • Experience with PostgreSQL, Elasticsearch, and vector databases such as Qdrant or similar technologies.
      • Experience with monitoring and security tools such as Datadog, AWS GuardDuty, CloudWatch, and CloudTrail.
      • Certifications in AWS, Azure, or Kubernetes.
      • Experience with other cloud platforms like Google Cloud Platform (GCP).
      • Experience with distributed tracing and observability tools.About You
        • You thrive in fast-paced environments and are excited to work at a high-growth startup.
        • You're ambitious and enjoy solving problems that others think are impossible.
        • You're curious and find joy in learning about AI, technology, and finance.
        • You're autonomous, self-directed, and comfortable working with ambiguity.
        • You're collaborative, organized, and thoughtful in your approach.Compensation & Benefits
          • Competitive salary range: $140,000 - $180,000 (depending on experience).
          • Comprehensive health, dental, and vision insurance.
          • Generous vacation policy.
          • Equity options.
          • Professional development opportunities.
          • Work with a world-class team on frontier technology.
          • Beautiful office in the heart of NYC.
            #J-18808-Ljbffr

Keywords: Recruiting From Scratch, New York , Site Reliability Engineer, Engineering , New York, New York

Click here to apply!

Didn't find what you're looking for? Search again!

I'm looking for
in category
within


Log In or Create An Account

Get the latest New York jobs by following @recnetNY on Twitter!

New York RSS job feeds