Data Engineer

SEMIANALYSIS PRIVATE LIMITED•THE ARCADE, 11 COLLYER QUAY, 049317, Singapore

Full-timeMid Level

$11k - $14k

per year

👁️ 2 views•📝 0 applications•Posted 6/17/2026•Expires 7/17/2026

Tailor Resume for This Job Check ATS Score View Original Posting ↗

Job Description

Position Overview We are seeking a highly capable and commercially minded Data Engineer to join our Singapore team. This role will own the design, development, and reliability of the data models, pipelines, dashboards, and APIs that power our industry research, consulting work, and client-facing analytics products. You will work closely with lead analysts, researchers, engineering stakeholders, and commercial teams to transform complex, fragmented data sources into accurate, scalable, and decision-useful data products. The ideal candidate is not just a pipeline builder, but someone who can architect robust systems, understand business context, challenge data assumptions, and independently drive solutions from problem definition through production deployment. This is a high-autonomy role suited for someone who can operate with minimal oversight, make sound technical decisions, and build infrastructure that supports both internal research velocity and external client delivery. If you have a favorite SCD type — ours is Type 2 — we should probably talk. Responsibilities Own the architecture, development, and maintenance of core data models that support SemiAnalysis’s research, consulting, and client-facing analytics products. Design, build, and optimize scalable ETL/ELT pipelines across multiple structured and unstructured data sources. Partner with lead analysts to ensure data accuracy, completeness, consistency, and commercial utility across research workflows. Translate ambiguous business and research requirements into reliable data models, dashboards, APIs, and analytical tools. Maintain and extend internal and external-facing dashboards, APIs, and data delivery systems. Establish strong data quality, validation, lineage, observability, and monitoring practices across key datasets and pipelines. Improve the performance, reliability, and modularity of existing data infrastructure. Support the integration of new datasets, tools, vendors, and infrastructure components