We are a fast-growing SaaS company in the EduTech space, operating mainly in Singapore, Vietnam, and Malaysia. With almost 2000 schools as customers, our mission is to revolutionize early childhood education through technology and provide innovative solutions to our customers. We are committed to transforming the way education is delivered in the Southeast Asian region and beyond.
Collecting data from different sources, processing it, and storing it in a ready-to-use format in the company’s data warehouse using tools such as AWS Glue, Databricks, Airflow
Identifying, designing and implementing data pipeline improvements for greater scalability, optimizing data delivery, and automating manual processes. Parquet in S3 for storage, Airbyte for pipeline management
Improve data models that feed business intelligence tools, increasing data accessibility and fostering data-driven decision making across the organization.
Engage proactively with business and product teams to gather and comprehend data requirements, ensuring seamless communication and collaboration throughout the data engineering process.
Develop strategy for long term data platform architecture based on business and engineering needs
Performs data analysis required to troubleshoot data related issues and assist in the resolution of data issues
Own company data assets (data models), spark, sparkSQL, and hiveSQL jobs to populate data models
Deep expertise in AWS Glue, Databricks, Airflow
Experience with Airbyte, Fivetran, Stitch
Experience with Spark
Experience with SQL, Nosql, Parquet