
Site Reliability Engineer
2 weeks ago
Responsibilities
About the Team The Data Management Suite team is building products that cover the whole lifecycle of data pipeline, including data ingestion and integration, data development, data catalog, data security and data governance. These products support various businesses, so data engineers and data scientists could greatly boost their productivity.
As a software engineer in the data management suite team, you will have the opportunity to build, optimize and grow one of the largest data platforms in the world. You\'ll have the opportunity to gain hands-on experience on core systems in the data platform ecosystem. Your work will have a direct and huge impact on the company\'s core products as well as hundreds of millions of users.
Be responsible for the production stability for big data development and governance systems.
Engage in and improve the whole lifecycle of service, from inception and design, through to deployment, operation and refinement.
Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
Practice sustainable incident response and blameless postmortems.
Establish best engineering practice for engineers as well as non-technical people.
Design and implement reliable, scalable, robust and extensible big data systems that support core products and business.
Qualifications
Minimum Qualifications
Bachelor\'s degree in Computer Science, a related technical field involving software or systems engineering, or equivalent practical experience.
Experience with site reliability engineering, monitoring, alerting for big data related systems.
Experience writing code in Java, Go, Python or a similar language.
Preferred Qualifications
Knowledge about a variety of strategies for ingesting, modeling, processing, and persisting data, ETL design, job scheduling and dimensional modeling.
Familiarity with running production grade services at scale and understanding cloud native technologies and networking.
Experience developing tools and APIs to reduce human interaction with systems and applications using a variety of coding and scripting standards.
Expertise in designing, analyzing, and troubleshooting large-scale distributed systems is a plus (Hadoop, M/R, Hive, Spark, Presto, Flume, Kafka, ClickHouse, Flink or comparable solutions).
Systematic problem-solving approach, coupled with effective communication skills and a sense of drive.
#J-18808-Ljbffr
-
Site Reliability Engineer
2 weeks ago
Singapore IDEMIA Full timeJoin to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. PurposeThis role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...
-
Site Reliability Engineer
1 week ago
Singapore IDEMIA Full timeJoin to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. PurposeThis role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...
-
Site Reliability Engineer
1 week ago
Singapore IDEMIA Full timeJoin to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. Purpose This role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...
-
Site Reliability Engineer
2 weeks ago
Singapore beBeeSiteReliability Full timeUnlock Your Full Potential in Site Reliability Engineering About the Role This is an exciting opportunity to work with a global banking institution, leveraging your skills in production management and site reliability engineering to drive business growth. Develop and implement proactive, predictive models for shift production management using SRE...
-
Site Reliability Engineer
1 week ago
Singapore HCLTech Full timeGet AI-powered advice on this job and more exclusive features. This role combines software and systems engineering to build run, and maintain high performant, distributed, fault tolerant and resilient financial systems. Site Reliability Engineers focus on ensuring a joyful customer journey. As a Site Reliability Engineer you will be filling a...
-
Site Reliability Engineer
1 week ago
Singapore HCLTech Full timeGet AI-powered advice on this job and more exclusive features. This role combines software and systems engineering to build run, and maintain high performant, distributed, fault tolerant and resilient financial systems. Site Reliability Engineers focus on ensuring a joyful customer journey. As a Site Reliability Engineer you will be filling a...
-
Site Reliability Engineer
1 week ago
Singapore DHATCH CONSULTANCY PTE. LTD. Full timeSite Reliability Engineer: **Preferred Qualifications** - 3+ years of experience in site reliability engineering, DevOps, or software engineering roles. - Proven skills in: - Monitoring & alerting tools (Grafana, New Relic) - CI/CD pipelines (Git, Jenkins, GitHub Actions, etc.) - Container orchestration (Docker, Kubernetes) - Infrastructure-as-code...
-
Site Reliability Engineer
2 weeks ago
North-East Singapore PERSOLKELLY Full timeThe Site Reliability Engineer is responsible for ensuring the reliability, scalability, and efficiency of our systems and infrastructure. This role involves monitoring, troubleshooting, and resolving issues to maintain optimal performance. The engineer will also collaborate with cross-functional teams to automate processes and improve system reliability....
-
Site Reliability Engineer Assistant
3 weeks ago
Singapore Manpower Singapore Full timeSite Reliability Engineer Assistant (DevOps) Site Reliability Engineer Assistant (DevOps) This range is provided by Manpower Singapore. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range Responsible for the operation and maintenance of online game marketing services, to ensure the continuous...
-
Site Reliability Engineer
4 weeks ago
Singapore RigNet Full timeAbout us One team. Global challenges. Infinite opportunities. At Viasat, we're on a mission to deliver connections with the capacity to change the world. For more than 35 years, Viasat has helped shape how consumers, businesses, governments and militaries around the globe communicate. We're looking for people who think big, act fearlessly, and create an...