Job Descrption
Requirements
• We welcome candidates at various experience levels for this role,
• Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience),
• Proven experience as a DevOps Engineer or similar role, with a focus on automation and software tooling,
• Strong knowledge of CI/CD principles and experience with build and release management tools like Github and/or GitLab,
• Proficient in scripting and programming languages such as Python, Bash, or Go, with the ability to write clean, efficient, and reusable code,
• Experience with configuration management tools like Ansible,
• Familiarity with containerization technologies like Docker and container orchestration platforms like Kubernetes,
• Solid understanding of networking concepts, including TCP/IP, load balancing, and DNS,
• Strong problem-solving skills with a proactive and analytical mindset,
• Excellent communication and collaboration skills, with the ability to work effectively in a... team-oriented environment,
• Experience with observability tools such as Prometheus, Grafana, or ELK stack,
• Understanding of security best practices and experience with securing Linux infrastructure and applications,
• Familiarity with Agile/Scrum methodologies and working in an Agile development environment
What the job involves
• As a Tenstorrent DevOps Engineer, you will play a critical role in building and maintaining our robust and scalable development and hybrid cloud/on-prem testing infrastructure,
• You will work closely with cross-functional teams, including software engineers, system administrators, and QA, to ensure the availability, reliability, and performance of our development and testing environments,
• A significant part of your work is focused on our automation platform for our on premise colocation data centers,
• Design, implement, and maintain automation tools and frameworks to streamline software delivery and infrastructure management processes,
• Collaborate with development teams to ensure reliable and efficient deployment pipelines, continuous integration, and delivery workflows,
• Develop and maintain monitoring and alerting systems to proactively identify and address performance bottlenecks, availability issues, and other infrastructure-related problems,
• Create and maintain infrastructure-as-code (IaC) templates using tools like Terraform and Ansible,
• Implement and manage containerization technologies such as Docker and orchestration platforms like Kubernetes for scalable and resilient deployments,
• Collaborate with security teams to implement and enforce best practices for securing infrastructure and applications,
• Participate in incident response and troubleshooting activities to identify root causes and implement preventive measures,
• Isolate and root cause on-prem hardware failures,
• Continuously research and evaluate emerging technologies, tools, and methodologies to drive innovation and improve operational efficiency
Your CV has been submitted successfully.