Web Crawling Engineer

20 hours ago


Singapore HYPERGAI PTE. LTD. Full time

**About HyperGAI**
- HyperGAI, building Hyper General AI platforms with world's leading Multimodal Generative AI, empowers everyone to work and live better in a whole new way. We are a small but fast-growing Generative AI deep-tech startup, founded by globally renowned AI thought leaders and a group of exceptional AI experts, talented researchers and experienced engineers. The growth and success of HyperGAI are driven by our amazing employees who believe in the same mission and live with our core values of agility, innovation, trust and one-team. Join us**Job Details**:
As a WebCrawler engineer, you will work with other teams to ensure the smooth progress of product iterations, continuously improve the user experience of the product through in-depth understanding of the business and products, and use technology to drive business growth. Your expertise will play a key role to build our products and achieve our vision.

**Responsibilities**:

- **Designing and Implementing Web Crawlers**: Develop scalable and efficient web crawling systems to gather data from various online sources. This involves understanding the structure of different websites, implementing crawling algorithms, handling dynamic content, and ensuring compliance with legal and ethical guidelines.
- **Data Extraction and Parsing**: Write scripts or develop algorithms to extract relevant information from web pages. This may involve parsing HTML/XML documents, using regular expressions, or employing advanced parsing techniques such as natural language processing (NLP) to extract structured data from unstructured sources.
- **Data Quality Assurance**: Implement mechanisms to ensure the quality and reliability of crawled data. This includes error handling, data validation, deduplication, and dealing with inconsistencies or missing data.
- **Scalable Data Storage and Management**: Design and develop backend systems to store, organize, and manage large volumes of crawled data efficiently. This may involve selecting appropriate databases (e.g., relational databases, NoSQL databases), optimizing database schemas, and implementing data caching and indexing strategies for faster retrieval.
- **Performance Optimization**: Optimize the performance of web crawling and data management systems to handle large-scale data processing efficiently. This includes optimizing algorithms, minimizing resource usage, and parallelizing data processing tasks.
- **Monitoring and Maintenance**: Implement monitoring tools and logging mechanisms to track the health and performance of web crawling and data management systems. Proactively identify and resolve issues such as crawling failures, performance bottlenecks, or data inconsistencies.
- **Security and Compliance**: Ensure that the web crawling and data management systems adhere to security best practices and regulatory requirements. Implement mechanisms to protect against security threats such as XSS (Cross-Site Scripting) attacks, CSRF (Cross-Site Request Forgery) attacks, and data breaches.

**Qualifications**:

- Possess a Bachelor’s degree or above in Computer Science, Software Engineering, Technical, Science, E-Commerce, Information Technology, Mathematics or software related majors.
- More than 3 years of web crawler experience, experience in large-scale web crawling and multimodal (text, image, video) data is preferred.
- Proficient in one or more programming languages such as Golang/Python/PHP/Java, with strong architectural capabilities and good coding standards.
- Familiar with common databases, such as MySQL, Redis, HBase, etc.
- Experience in web crawling libraries like Scrapy, Beautiful Soup, Selenium, Apache Nutch.
- Prior experience in Search engine companies is a plus.


  • QA Engineer

    4 days ago


    Singapore Ahrefs Full time

    We are looking for a QA Engineer based out of our Singapore headquarters. **You’ll Need**: - 5+ years software testing experience - Strong organizational, written and verbal communication skills - Strong analytical and problem-solving skills with a strong desire to break software - Good documentation skills - Aptitude for learning and enjoys being a team...

  • Software Engineer

    1 week ago


    Singapore TikTok Full time

    TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo. At TikTok, our people are humble, intelligent, compassionate and creative. We create to inspire - for you, for us, and...

  • Software Engineer

    4 days ago


    Singapore TikTok Full time

    TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo. At TikTok, our people are humble, intelligent, compassionate and creative. We create to inspire - for you, for us, and...

  • Software Engineer

    4 days ago


    Singapore TIKTOK PTE. LTD. Full time

    TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo. At TikTok, our people are humble, intelligent, compassionate and creative. We create to inspire - for you, for us, and...

  • Software Engineer

    4 days ago


    Singapore ABSOLUT POH TEO INVESTMENT HOLDING LLP Full time

    **Software Engineer (Fullstack) What to expect: - Write automated functional and unit testing for the code you write - Assist DevOps Engineer on build and release automation - Work closely with fellow developers in pair programming and code review - Work closely with Business Analysts, Quality Engineers, and User Interface/User Experience Designers to...


  • Singapore Hytech Full time

    Overview We are seeking a Market Intelligence Analyst with expertise in web crawling, data extraction, and competitive intelligence to join our BI team. You will design and implement Python-based crawlers to collect, analyze, and derive actionable insights from search engines, social media platforms, app stores, and industry news. Your work will support...


  • Singapore Canonical Full time

    Join to apply for the Engineering Manager - Web role at Canonical 3 days ago Be among the first 25 applicants Join to apply for the Engineering Manager - Web role at Canonical Get AI-powered advice on this job and more exclusive features. Canonical is a leading provider of open-source software and operating systems to the global enterprise and technology...

  • Software Engineer

    1 week ago


    Singapore TIKTOK PTE. LTD. Full time

    TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo. Why Join Us At TikTok, our people are humble, intelligent, compassionate and creative. We create to inspire - for...

  • Software Engineer

    1 week ago


    Singapore TikTok Full time

    TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo. Why Join Us At TikTok, our people are humble, intelligent, compassionate and creative. We create to inspire - for you,...

  • Web Engineer

    1 week ago


    Singapore Human Made, Makers Of Altis Dxp Full time

    **Requirements**: **Responsibilities**: - All responsibilities described under web engineer, plus: - Take ownership at a project level for not only your own work, but work that interacts or impacts your own - Assisting other developers on your team when asked for assistance - Architect projects and complex subsystems - Assess the pros and cons of different...