Web Crawling Engineer

3 days ago


Singapore HYPERGAI PTE. LTD. Full time

**About HyperGAI**
- HyperGAI, building Hyper General AI platforms with world's leading Multimodal Generative AI, empowers everyone to work and live better in a whole new way. We are a small but fast-growing Generative AI deep-tech startup, founded by globally renowned AI thought leaders and a group of exceptional AI experts, talented researchers and experienced engineers. The growth and success of HyperGAI are driven by our amazing employees who believe in the same mission and live with our core values of agility, innovation, trust and one-team. Join us**Job Details**:
As a WebCrawler engineer, you will work with other teams to ensure the smooth progress of product iterations, continuously improve the user experience of the product through in-depth understanding of the business and products, and use technology to drive business growth. Your expertise will play a key role to build our products and achieve our vision.

**Responsibilities**:

- **Designing and Implementing Web Crawlers**: Develop scalable and efficient web crawling systems to gather data from various online sources. This involves understanding the structure of different websites, implementing crawling algorithms, handling dynamic content, and ensuring compliance with legal and ethical guidelines.
- **Data Extraction and Parsing**: Write scripts or develop algorithms to extract relevant information from web pages. This may involve parsing HTML/XML documents, using regular expressions, or employing advanced parsing techniques such as natural language processing (NLP) to extract structured data from unstructured sources.
- **Data Quality Assurance**: Implement mechanisms to ensure the quality and reliability of crawled data. This includes error handling, data validation, deduplication, and dealing with inconsistencies or missing data.
- **Scalable Data Storage and Management**: Design and develop backend systems to store, organize, and manage large volumes of crawled data efficiently. This may involve selecting appropriate databases (e.g., relational databases, NoSQL databases), optimizing database schemas, and implementing data caching and indexing strategies for faster retrieval.
- **Performance Optimization**: Optimize the performance of web crawling and data management systems to handle large-scale data processing efficiently. This includes optimizing algorithms, minimizing resource usage, and parallelizing data processing tasks.
- **Monitoring and Maintenance**: Implement monitoring tools and logging mechanisms to track the health and performance of web crawling and data management systems. Proactively identify and resolve issues such as crawling failures, performance bottlenecks, or data inconsistencies.
- **Security and Compliance**: Ensure that the web crawling and data management systems adhere to security best practices and regulatory requirements. Implement mechanisms to protect against security threats such as XSS (Cross-Site Scripting) attacks, CSRF (Cross-Site Request Forgery) attacks, and data breaches.

**Qualifications**:

- Possess a Bachelor’s degree or above in Computer Science, Software Engineering, Technical, Science, E-Commerce, Information Technology, Mathematics or software related majors.
- More than 3 years of web crawler experience, experience in large-scale web crawling and multimodal (text, image, video) data is preferred.
- Proficient in one or more programming languages such as Golang/Python/PHP/Java, with strong architectural capabilities and good coding standards.
- Familiar with common databases, such as MySQL, Redis, HBase, etc.
- Experience in web crawling libraries like Scrapy, Beautiful Soup, Selenium, Apache Nutch.
- Prior experience in Search engine companies is a plus.


  • Web Adminstrator

    1 day ago


    Singapore WINDOW-COOL (S) PTE. LTD. Full time

    **Web Operations & SEO Specialist Responsibilities** - **Website Monitoring & Maintenance**: Ensure the smooth operation of web servers, hardware, and software. Regularly monitor website functionality and performance across all browsers to guarantee optimal user experience. - **Content & User Engagement**: Manage and revise website content. Design product...


  • Singapore Hytech Full time

    We are seeking a Market Intelligence Analyst with expertise in web crawling, data extraction, and competitive intelligence to join our BI team. In this role, you will design and implement Python-based crawlers to collect, analyze, and derive actionable insights from search engines, social media platforms, app stores, and industry news. Your work will...

  • QA Engineer

    6 days ago


    Singapore Ahrefs Full time

    We are looking for a QA Engineer based out of our Singapore headquarters. **You’ll Need**: - 5+ years software testing experience - Strong organizational, written and verbal communication skills - Strong analytical and problem-solving skills with a strong desire to break software - Good documentation skills - Aptitude for learning and enjoys being a team...

  • Tech Lead

    1 day ago


    Singapore TikTok Full time

    The e-commerce industry has seen tremendous growth in recent years and has become a hotly contested space amongst leading Internet companies, and its future growth cannot be underestimated. With millions of loyal users globally, we believe TikTok is an ideal platform to deliver a brand new and better e-commerce experience to our users. Our product...

  • Software Engineer

    6 days ago


    Singapore TIKTOK PTE. LTD. Full time

    TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo. At TikTok, our people are humble, intelligent, compassionate and creative. We create to inspire - for you, for us, and...

  • Software Engineer

    6 days ago


    Singapore TikTok Full time

    TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo. At TikTok, our people are humble, intelligent, compassionate and creative. We create to inspire - for you, for us, and...

  • Software Engineer

    6 days ago


    Singapore ABSOLUT POH TEO INVESTMENT HOLDING LLP Full time

    **Software Engineer (Fullstack) What to expect: - Write automated functional and unit testing for the code you write - Assist DevOps Engineer on build and release automation - Work closely with fellow developers in pair programming and code review - Work closely with Business Analysts, Quality Engineers, and User Interface/User Experience Designers to...


  • Singapore Sea Limited Full time

    Engineering and Technology - Sea Corporate Lab, Singapore - Entry Level - Collaborate with the design team, other software engineers and product managers to design, build, enhance, and test new features - Write clean and maintainable code that is properly tested and reviewed - Understand the product thoroughly. Analyse, design and develop functionalities...

  • Software Engineer

    2 weeks ago


    Singapore HEPMIL SINGAPORE PTE. LTD. Full time

    **Role-Based Objectives** - Build and maintain enterprise systems. - Innovate and experiment to improve systems for seamless user experiences. - Continual learning and experimentation in product innovation. - Build and maintain enterprise API web services. - Create an integration layer for cross-platform and software products. - Manage data stack. -...

  • Engineering Manager

    1 week ago


    Singapore Canonical Full time

    Join to apply for the Engineering Manager - Web role at Canonical 3 days ago Be among the first 25 applicants Join to apply for the Engineering Manager - Web role at Canonical Get AI-powered advice on this job and more exclusive features. Canonical is a leading provider of open-source software and operating systems to the global enterprise and technology...