Current jobs related to Large Scale Infrastructure Specialist - Singapore - beBeeSoftwareEngineer


  • Singapore beBeeDistributedTraining Full time $125,000 - $175,000

    Distributed Training & Inference Optimization SpecialistWe are looking for a skilled specialist to maximize the performance and efficiency of large-scale training and inference workloads on our GPU clusters.Key Responsibilities:Optimize LLM training frameworks: Maximize GPU utilization and reduce training time using PyTorch, DeepSpeed, Megatron-LM, and...


  • Singapore beBeeInfrastructure Full time $80,000 - $120,000

    Software Infrastructure SpecialistThe MRS ML Infra team focuses on optimizing performance and efficiency for AI training and inference workflows in large-scale recommendation domains. In this role, you will work on enhancing the e2e stack for model training and inference for large-scale recommendation models.This involves identifying and executing short- and...


  • Singapore beBeeProject Full time $90,000 - $120,000

    Project Manager Job OpportunitySeeking an experienced project manager to lead infrastructure and automation projects. Key responsibilities include managing resources, risks, and change.The ideal candidate will have strong leadership skills, strategic thinking, and excellent communication skills.Minimum 10 years of experience in IT and engineering projects,...


  • Singapore beBeeCivil Full time $180,000 - $250,000

    We are seeking an experienced construction professional to join our team as a Senior Project Manager.Job Summary:This role will involve overseeing the execution of large-scale infrastructure projects, ensuring timely completion, and managing budgets. The ideal candidate will have a strong background in civil engineering and at least 10 years of experience in...


  • Singapore beBeeInfrastructure Full time $120,000 - $180,000

    About this role: Job Description:This is a high-level, executive position within our organization responsible for overseeing the operations of large-scale data centers. The successful candidate will have extensive experience in managing complex IT infrastructure and leading cross-functional teams.Key Responsibilities:Manage day-to-day operations in our data...


  • Singapore beBeeData Full time $120,000 - $180,000

    We are seeking a skilled Big Data Engineer to join our E-commerce Recommendation Infrastructure team.Job DescriptionThis role involves designing and implementing large-scale recommendation systems, ensuring high-performance storage and computing systems, and troubleshooting production issues. You will work closely with applied machine learning engineers to...


  • Singapore beBeeBim Full time $80,000 - $120,000

    Are you an expert in Building Information Modelling (BIM) looking to take your skills to the next level? We are seeking a highly skilled BIM Coordinator to join our team working on large scale infrastructure projects.About the RoleThis is an exciting opportunity for a motivated and experienced BIM Coordinator to provide technical support to our design teams,...


  • Singapore beBeeReliability Full time $125,000 - $175,000

    Job TitleA senior site reliability engineer is needed to ensure the smooth operation of large-scale distributed systems.Design, deploy, and manage CI/CD pipelines to deliver software consistently.Administer, scale, and optimize Kubernetes deployments for high availability and fault tolerance.Architect and maintain microservices infrastructure for seamless...


  • Singapore beBeeNetwork Full time $80,000 - $120,000

    Project Manager for Large-Scale IntegrationThis is a senior-level role responsible for leading large-scale integration projects from initiation to completion.Technical Project Leadership: Direct and manage cross-functional teams comprising engineers, designers, and other technical specialists.Integration Strategy Development: Collaborate with stakeholders to...


  • Singapore beBeeData Full time $80,000 - $120,000

    Software Engineering Opportunity">This role is responsible for crafting and implementing a storage solution for offline data in the recommendation system, catering to over a billion users.The primary objectives are to guarantee system reliability, uninterrupted service, and seamless performance. The team aims to create a storage and computing infrastructure...

Large Scale Infrastructure Specialist

2 weeks ago


Singapore beBeeSoftwareEngineer Full time $80,000 - $120,000
Infrastructure Software Engineer

This role is a key part of our large-scale AI training and inference infrastructure, focusing on performance and efficiency in the recommendation domain.

You will be optimizing the end-to-end stack for model training and inference, working on distributed systems, model/system co-design, GPU optimizations, and more.

Your main responsibilities include identifying and leading efforts to improve efficiency, as well as shaping long-term strategies such as performance automation and regression mitigation.

We are looking for an experienced professional with 6+ years programming experience in relevant languages, and 6+ years building large-scale infrastructure applications. You should have experience in system efficiency, scalability, and stability, ownership of system components, and proficiency in scripting languages like Python, JavaScript, or Hack.

We also look for a track record of high-quality, reliable work, experience leading technical teams, and cross-team collaboration, knowledge of quality assurance practices, and a Bachelor's degree in a relevant technical field or equivalent experience.

Besides these required skills and qualifications, we also welcome knowledge of large-scale software architecture, experience with C, C++, Java, hands-on experience with large-scale AI infrastructure systems, experience with training/inference for large models, expertise in high-performance computing and GPU optimization.

{ul>Identify performance bottlenecks across models, infrastructure, and systemsImplement improvements to enhance efficiencyGuide other engineers on performance opportunitiesCoordinate with cross-functional ML teamsDefine technical strategies and roadmapsMentor team members