Current jobs related to Large Scale Infrastructure Specialist - Singapore - beBeeSoftwareEngineer
-
Large Scale Distributed Training Specialist
2 weeks ago
Singapore beBeeDistributedTraining Full time $125,000 - $175,000Distributed Training & Inference Optimization SpecialistWe are looking for a skilled specialist to maximize the performance and efficiency of large-scale training and inference workloads on our GPU clusters.Key Responsibilities:Optimize LLM training frameworks: Maximize GPU utilization and reduce training time using PyTorch, DeepSpeed, Megatron-LM, and...
-
Optimizing Large-Scale Recommendation Systems
2 weeks ago
Singapore beBeeInfrastructure Full time $80,000 - $120,000Software Infrastructure SpecialistThe MRS ML Infra team focuses on optimizing performance and efficiency for AI training and inference workflows in large-scale recommendation domains. In this role, you will work on enhancing the e2e stack for model training and inference for large-scale recommendation models.This involves identifying and executing short- and...
-
Singapore beBeeProject Full time $90,000 - $120,000Project Manager Job OpportunitySeeking an experienced project manager to lead infrastructure and automation projects. Key responsibilities include managing resources, risks, and change.The ideal candidate will have strong leadership skills, strategic thinking, and excellent communication skills.Minimum 10 years of experience in IT and engineering projects,...
-
Senior Project Manager
2 weeks ago
Singapore beBeeCivil Full time $180,000 - $250,000We are seeking an experienced construction professional to join our team as a Senior Project Manager.Job Summary:This role will involve overseeing the execution of large-scale infrastructure projects, ensuring timely completion, and managing budgets. The ideal candidate will have a strong background in civil engineering and at least 10 years of experience in...
-
Singapore beBeeInfrastructure Full time $120,000 - $180,000About this role: Job Description:This is a high-level, executive position within our organization responsible for overseeing the operations of large-scale data centers. The successful candidate will have extensive experience in managing complex IT infrastructure and leading cross-functional teams.Key Responsibilities:Manage day-to-day operations in our data...
-
Large Scale Data Architect
1 week ago
Singapore beBeeData Full time $120,000 - $180,000We are seeking a skilled Big Data Engineer to join our E-commerce Recommendation Infrastructure team.Job DescriptionThis role involves designing and implementing large-scale recommendation systems, ensuring high-performance storage and computing systems, and troubleshooting production issues. You will work closely with applied machine learning engineers to...
-
Singapore beBeeBim Full time $80,000 - $120,000Are you an expert in Building Information Modelling (BIM) looking to take your skills to the next level? We are seeking a highly skilled BIM Coordinator to join our team working on large scale infrastructure projects.About the RoleThis is an exciting opportunity for a motivated and experienced BIM Coordinator to provide technical support to our design teams,...
-
Large Scale Distributed Systems Engineer
1 week ago
Singapore beBeeReliability Full time $125,000 - $175,000Job TitleA senior site reliability engineer is needed to ensure the smooth operation of large-scale distributed systems.Design, deploy, and manage CI/CD pipelines to deliver software consistently.Administer, scale, and optimize Kubernetes deployments for high availability and fault tolerance.Architect and maintain microservices infrastructure for seamless...
-
Senior Large Scale Integration Project Lead
2 weeks ago
Singapore beBeeNetwork Full time $80,000 - $120,000Project Manager for Large-Scale IntegrationThis is a senior-level role responsible for leading large-scale integration projects from initiation to completion.Technical Project Leadership: Direct and manage cross-functional teams comprising engineers, designers, and other technical specialists.Integration Strategy Development: Collaborate with stakeholders to...
-
Large Scale Data Storage Engineer
2 weeks ago
Singapore beBeeData Full time $80,000 - $120,000Software Engineering Opportunity">This role is responsible for crafting and implementing a storage solution for offline data in the recommendation system, catering to over a billion users.The primary objectives are to guarantee system reliability, uninterrupted service, and seamless performance. The team aims to create a storage and computing infrastructure...

Large Scale Infrastructure Specialist
2 weeks ago
This role is a key part of our large-scale AI training and inference infrastructure, focusing on performance and efficiency in the recommendation domain.
You will be optimizing the end-to-end stack for model training and inference, working on distributed systems, model/system co-design, GPU optimizations, and more.
Your main responsibilities include identifying and leading efforts to improve efficiency, as well as shaping long-term strategies such as performance automation and regression mitigation.
We are looking for an experienced professional with 6+ years programming experience in relevant languages, and 6+ years building large-scale infrastructure applications. You should have experience in system efficiency, scalability, and stability, ownership of system components, and proficiency in scripting languages like Python, JavaScript, or Hack.
We also look for a track record of high-quality, reliable work, experience leading technical teams, and cross-team collaboration, knowledge of quality assurance practices, and a Bachelor's degree in a relevant technical field or equivalent experience.
Besides these required skills and qualifications, we also welcome knowledge of large-scale software architecture, experience with C, C++, Java, hands-on experience with large-scale AI infrastructure systems, experience with training/inference for large models, expertise in high-performance computing and GPU optimization.
{ul>Identify performance bottlenecks across models, infrastructure, and systemsImplement improvements to enhance efficiencyGuide other engineers on performance opportunitiesCoordinate with cross-functional ML teamsDefine technical strategies and roadmapsMentor team members