
Senior Machine Learning Engineer
2 days ago
**Who Are We Looking For**:
This role requires someone familiar with the **dynamic nature of a startup**, capable of rapidly designing and implementing scalable solutions. You'll work closely with research teams to optimize performance and ensure seamless integration of systems, handling data from **financial institutions, government agencies, consumer brands, and internet companies**.
**Key Responsibilities:Strong understanding of ML concepts and algorithms**:
Practical experience working with models in production settings in AI / data science teams to transform AI / data science code into scalable, production-ready systems.
**Data Ingestion & Integration**:
- Ingest data from **enterprise relational databases** such as **Oracle**, **SQL Server**, **PostgreSQL**, and **MySQL**, as well as **enterprise SQL-based data warehouses** like **Snowflake**, **BigQuery**, **Redshift**, **Azure Synapse**, and **Teradata** for large-scale analytics.
**Data Validation & Quality Assurance**:
- Ensure ingested data conforms to predefined **schemas**, checking data types, missing values, and field constraints.
- Implement **data quality checks** for nulls, outliers, and duplicates to ensure data reliability.
**Data Transformation & Processing**:
- **Design scalable data pipelines** for **batch processing**, deciding between **distributed computing** tools like **Spark**, **Dask**, or **Ray** when handling extremely large datasets across multiple nodes, and **single-node tools** like **Polars** and **DuckDB** for more lightweight, efficient operations. The choice will depend on the size of the data, system resources, and performance requirements.
- **Leverage Polars** for high-speed, in-memory data manipulation when working with large datasets that can be processed efficiently in-memory on a single node.
- **Utilize DuckDB** for on-disk query execution, offering SQL-like operations with mínimal overhead, suitable for environments that need a balance between memory use and query performance.
- Seamlessly transform **Pandas-based research code** into **production-ready pipelines**, ensuring efficient memory usage and fast data access without adding unnecessary complexity.
**Data Storage & Retrieval**:
- Work with internal data representations such as **Parquet**, **Arrow**, and **CSV** to support the needs of our generative models, choosing the appropriate format based on **data processing and performance needs**.
**Distributed Systems & Scalability**:
- Ensure that the system can **scale efficiently from a single node to multiple nodes**, providing **graceful scaling** for users with varying compute capacities.
- Optimize **SQL-based queries** for performance and scalability in **enterprise SQL environments**, ensuring efficient querying across large datasets.
**GPU Acceleration & Parallel Processing**:
- Utilize **GPU acceleration** and **parallel processing** to improve performance in large-scale model training and data processing.
**Data Lineage & Metadata Management** (Reduced Emphasis):
- Implement **basic data lineage** for auditability, ensuring traceability in data transformations when required.
- Manage metadata as needed to document pipelines and workflows.
**Error Handling, Recovery, & Performance Monitoring**:
- Design robust **error handling** mechanisms, with **automatic retries** and **data recovery** in case of pipeline failures.
- Track performance metrics such as **data throughput**, **latency**, and **processing times** to ensure efficient pipeline operations at scale.
**Documentation & Reporting**:
- Create clear **documentation** of data pipelines, workflows, and system architectures to enable smooth handovers and collaboration across teams.
**Essential Skills and Qualifications**:High Priority**:
- Hands-on experience **scaling data pipelines** and **machine learning systems** to handle **hundreds of millions to billions of rows** in enterprise environments.
- 4+ years of experience in building scalable data solutions with **Python** and distinct libraries such as:
- **Data Science Libraries**:Pandas**, **NumPy**, **Scikit-learn**.
- **Scaling Libraries**:Polars** for in-memory processing and **DuckDB** for efficient on-disk queries.
- Ability to **choose the right framework** (e.g., **Dask**, **Ray**, **Polars**, **DuckDB**) depending on the workload and environment, with a focus on balancing simplicity and scalability.
- Experience in **data validation** and ensuring data quality with tools like **Pandera** or **Pydantic**.
- Proficiency in building **ETL/ELT pipelines** and managing data across **relational databases**, **data warehouses**, and **cloud storage**.
- Strong knowledge of **GPU parallelization** for deep learning models using **PyTorch**.
**Good to Have**:
- Experience with logging and monitoring in production environments.
- Understanding of **data lineage** and **metadata management** systems to support data transparency.
- Familiarity with **Pytest** for test
-
Machine Learning Manager
4 days ago
Remote, Singapore Bjak Full time**About Us** We leverage cutting-edge technology, including Custom APIs, trading systems, and data science, to simplify financial services that were once complex and inaccessible. Our ability to navigate complex regulations has enabled us to pioneer groundbreaking products, such as offering investment-linked life and health insurance online with instant...
-
Algorithm/ Machine Learning Engineer
2 days ago
Remote, Singapore Binance Full timeBinance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 250 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance...
-
Lead Ai Engineer
4 days ago
Remote, Singapore Bjak Full time**About Us** Our core strengths lie in navigating complex regulations and environments, creating some of the most innovative products in the world. For instance, we are the first platform globally to simplify and offer investment-linked life and health insurance online, coupled with an instant talk-to-agent service. If you enjoy building cutting-edge...
-
Head of Ai Engineer
4 days ago
Remote, Singapore Bjak Full time**About us** At **Bjak**, we're on a mission to make financial services more **affordable and accessible** across ASEAN. Headquartered in **Malaysia**, we are **Southeast Asia’s largest insurance portal**, helping millions find the best value and coverage for their insurance needs. Our strength lies in **cutting-edge technology**, including **Custom...
-
Senior Software Engineer
2 days ago
Remote, Singapore S&P Global Full time**Senior Software Engineer**: - Virtual, Poland - Information Technology - 314286 **Job Description**: **About The Role**: **Grade Level (for internal use)**: 10 ** The Team**: S&P Global is a global market leader in providing information, analytics and solutions for industries and markets that drive economies worldwide. The Market Intelligence (MI)...
-
Senior Manager, Solutions Engineering
2 days ago
Remote, Singapore Mambu Full time**Who we are**: Join the fintech revolution with Mambu, the leading SaaS cloud banking platform. We're on a mission to make banking better for a billion people. Explore exciting career opportunities and help shape the future of financial services. As the **Solutions Engineering/ Presales Manager**for the **APAC**region, you will play a pivotal role in...
-
Senior Backend/devops Engineer
4 days ago
Remote, Singapore Oppizi Full time**Job type** - Full-time **Seniority level**: - Senior **Work type** - Remote position **Schedule** - Monday to Friday, fixed hours (09:00 - 18:00) **Expected start date** - ASAP **Team members** - 12 Software Engineers, 3 AQA Engineers, 6 MQA Engineers, QA Lead, 2 UI/UX Designers, 3 Product Managers, Head of Product, and CTO **Team structure** - the...
-
Senior Frontend Engineer
1 week ago
Remote, Singapore JENNI Full time $120,000 - $180,000 per yearThis full-time remote role is for a Senior Frontend Engineer. The Senior Frontend Engineer will be responsible for developing user-facing features, ensuring the technical feasibility of UI/UX designs, optimizing applications for maximum speed and scalability, and collaborating with other team members and stakeholders to create a cohesive product. Daily tasks...
-
Senior Java Engineer
3 days ago
Remote, Singapore Binance Full time $120,000 - $200,000 per yearBinance is a leading global blockchain ecosystem behind the world's largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance...
-
Senior Product and Solutions Sales Manager
2 days ago
Remote, Singapore Dell Technologies Full timeSenior Product and Solutions Sales Manager (DPS) At Dell Technologies, we create the extraordinary. Our Outside Sales Product Specialists are the experts who sell innovation to the world. Responsible for a set of products and services, they get to know their portfolio inside and out. Our Outside Sales teams rely on them for technical advice during the sales...