Sre Implementation
1 week ago
Do you love a career where you Experience, Grow & Contribute at the same time, while earning at least 10% above the market? If so, we are excited to have bumped onto you.
Learn how we are redefining the meaning of work, and be a part of the team raved by Clients, Job-seekers and Employees.
Jobseeker Video Testimonials
Employee Glassdoor Reviews
If you are a SRE Implementation and looking for excitement, challenge and stability in your work, then you would be glad to come across this page.
We are an IT Solutions Integrator/Consulting Firm helping our clients hire the right professional for an exciting long term project. Here are a few details.
Check if you are up for maximizing your earning/growth potential, leveraging our Disruptive Talent Solution.
Role: SRE Implementation
Location: Singapore
We are looking for a highly skilled and experienced SRE (Site Reliability Engineer) Implementation specialist to join our team. As an SRE Implementation professional, you will be responsible for designing, building, and maintaining robust and scalable infrastructure solutions to ensure the reliability and performance of our systems. Your expertise in Azure Ops, Terraform, Data Dog, and SRE practices will be instrumental in enhancing the stability and efficiency of our technology stack.
**Responsibilities**:
Infrastructure Design and Implementation: Collaborate with development and operations teams to design and implement reliable and scalable infrastructure solutions on the Azure platform. Utilize your expertise in Azure Ops and Terraform to provision and configure cloud resources effectively.
Monitoring and Alerting: Implement monitoring and alerting systems using Data Dog to proactively identify performance issues, bottlenecks, and system anomalies. Configure and maintain monitoring dashboards to provide real-time visibility into system health and performance.
Incident Management: Take a lead role in incident management and resolution. Respond to and resolve critical incidents impacting system availability and performance promptly. Conduct post-incident reviews and implement measures to prevent recurrence.
Automation and CI/CD: Champion automation initiatives to streamline repetitive tasks and improve overall efficiency. Implement and maintain CI/CD pipelines to automate the deployment and configuration of infrastructure components.
Capacity Planning and Scalability: Collaborate with stakeholders to forecast capacity requirements and plan for scalability. Implement strategies to ensure our systems can handle increased loads and maintain high availability.
Security and Compliance: Ensure the security and compliance of the infrastructure by implementing best practices and adhering to industry standards. Collaborate with the security team to address vulnerabilities and implement security measures.
Documentation and Knowledge Sharing: Maintain comprehensive documentation of infrastructure, configurations, and processes. Share knowledge with team members to promote a culture of learning and collaboration.
**Requirements**:
**Requirements**:
- Bachelors degree in Computer Science, Information Technology, or a related field.
- Minimum 6 years of hands-on experience in SRE, system administration, or related roles.
- Strong expertise in Azure Ops and proven experience with Terraform for infrastructure provisioning and management.
- Proficiency in Data Dog or similar monitoring and analytics platforms for system monitoring and alerting.
- Solid understanding of SRE principles and best practices, including incident management, performance optimization, and automation.
- Experience with CI/CD tools and practices for automated deployments and configuration management.
- Knowledge of cloud security principles and best practices.
- Strong analytical and problem-solving skills with the ability to troubleshoot complex issues effectively.
- Excellent communication and collaboration skills to work effectively in a team-oriented environment.
- Proven track record of managing and maintaining highly available and scalable production systems.
**Benefits**:
CEO Message: Click HereClients Testimonial: Click Here
Azure Ops + Terraform + Data Dog + SRE
-
Engineer, Sre
2 weeks ago
Singapore Rakuten Full timeJob Description: Rakuten International oversees 7 businesses with over 4,000 employees globally. The brand is recognized for its leadership and innovation in e-commerce, digital content, advertising, entertainment and communications, bringing the joy of discovery and access to more than 1 billion members across the world. Our teams deliver on the...
-
Sre Engineer
1 week ago
Singapore ZENITH INFOTECH (S) PTE LTD. Full timePerformance Monitoring - Collaborate with the SRE team to maintain and improve the reliability and performance of our production systems. - Assist in the design, implementation, and deployment of scalable and automated infrastructure solutions. - Utilize Python programming skills to develop tools, scripts, and automation frameworks to enhance system...
-
Vp, Platform Sre Engineer, Sre
3 days ago
Singapore DBS Bank Full timeJob ObjectiveDBS Bank is looking for a Platform SRE Engineer with experience working on enterprise level data engineering, analytics, and observability applications. The SRE engineer would be responsible for ensuring high availability of the platform services and perform continuous improvements to increase the platform’s efficiency and resiliency. The SRE...
-
SRE Lead
2 weeks ago
Singapore PINPOINT ASIA Full timeOur client is a leading web3 firm that offers a cutting-edge, user-friendly solution that combines industry-leading security features with a powerful, intuitive interface in today's fast-paced digital economy, managing your cryptocurrency assets with security and ease. Their platform and wallet empower you to store, send, and receive a wide range of digital...
-
Site Reliability Engineer
3 days ago
Singapore DADACONSULTANTS PTE. LTD. Full timeSite Reliability Engineer (SRE) Responsibilities Assist in deploying and managing microservices on Kubernetes cloud platforms. Work with Cloud and DevOps teams to deploy services across multiple cloud providers (AWS, OCI, Azure, GCP). Conduct load and chaos testing to ensure system scalability and reliability. Support disaster recovery planning and...
-
Singapore YEPEESOFT PTE. LTD. Full timeKey Responsibilities Design, implement, and maintain CI/CD pipelines to enable fast, reliable releases Automate infrastructure provisioning using Infrastructure as Code (IaC) tools Manage and optimise cloud infrastructure (AWS / Azure / GCP) Deploy, scale, and manage containerised applications using Docker & Kubernetes Monitor system health, availability,...
-
Singapore DBS Bank Full timeThe Role:We are looking for a Platform SRE Engineer with experience working on enterprise level data engineering, analytics, and observability applications. The SRE engineer would be responsible for ensuring high availability of the platform services and perform continuous improvements to increase the platform's efficiency and resiliency. The SRE engineer...
-
SRE Developer
6 hours ago
Singapore Selini Capital Full timeJob Specification: Site Reliability Engineer (SRE) – Crypto High-Frequency TradingOverviewWe are seeking a Site Reliability Engineer (SRE) to design and build production configuration and deployment tools for our high-frequency trading (HFT) platform. This role is critical in ensuring the stability, scalability, and automation of our infrastructure. The...
-
Singapore DBS Bank Limited Full timeThe Role: We are looking for a Platform SRE Engineer with experience working on enterprise level data engineering, analytics, and observability applications. The SRE engineer would be responsible for ensuring high availability of the platform services and perform continuous improvements to increase the platform's efficiency and resiliency. The SRE engineer...
-
Senior Sre Architect, Security Engineering
2 weeks ago
Singapore ByteDance Full timeResponsibilities About the Company Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create...