Data Engineer, Data Platform - USDS

TikTok8 months ago
San Jose, California, United States
Hybrid
Full-time
Junior Level (1-3 years)

Job Description

Position Overview

Join TikTok's US Data Security (USDS) Data Engineering team in the Data Platform group. The team focuses on ensuring the stability, reliability, scalability, and risk management of TikTok's US data processing ecosystem while maintaining strict data compliance standards. As a Data Engineer, you will design innovative big data systems, optimize and maintain both batch and streaming pipelines, and collaborate with cross-functional partners in a hybrid work environment (3 days in-office) to directly impact TikTok’s core products and millions of users.

Key Responsibilities

  • Design, implement and maintain reliable, scalable, robust, and extensible big data systems supporting TikTok’s core products and business.
  • Build and maintain data pipelines utilizing optimal ETL patterns and frameworks, sourcing from both structured and unstructured data sources.
  • Extract data from various sources such as APIs, HIVE tables, and other data repositories to process and store large volumes of data with accuracy and security.
  • Implement and monitor quality control measures to ensure data accuracy, completeness, and consistency.
  • Create and maintain technical documentation (data dictionaries, data flow diagrams, system documentation) to streamline data management and analysis.
  • Optimize pipelines, dashboards, frameworks, and systems to facilitate the efficient development of data artifacts.
  • Collaborate with engineers, product managers, and data scientists to understand data needs and deliver key data insights.
  • Analyze and visualize data to provide impactful, actionable insights for business stakeholders.
  • Establish sound design practices and engineering standards for both technical and non-technical team members.

Required Qualifications

  • Bachelor’s degree in Statistics, Computer Science, or a related technical field, or equivalent practical experience.
  • 1+ years of experience in data analytics and data engineering, including data cleaning, preprocessing, ETL, data analysis, and dashboard development.
  • 1+ years of experience building dashboards using tools such as Tableau, Power BI, or similar visualization software.

Preferred Qualifications

  • Proficiency in distributed data processing using Big Data technologies like Spark/Scala, Python, Hadoop/HDFS/AWS/S3, and Kafka.
  • Strong skills in data modeling, data design, SQL, and various database systems.
  • 3+ years of experience with Big Data technologies (e.g., Hadoop, MapReduce, Hive, Spark, Metastore, Presto, Flume, Kafka, ClickHouse, Flink).
  • Solid background in algorithms and data structures.
  • Experience handling PII and GDPR-related data.
  • Excellent communication skills to effectively interact with both technical and non-technical partners.
  • Ability to deliver high-quality results in a fast-paced environment.
  • A passion for innovation and solving everyday challenges.

Benefits & Perks

  • Compensation: Base salary range of $118,657 - $259,200 annually. Actual compensation may vary based on qualifications, skills, experience, and location, and may include additional discretionary bonuses/incentives and restricted stock units.
  • Access to comprehensive medical, dental, and vision insurance starting on day one.
  • 401(k) savings plan with company match.
  • Paid parental leave, short-term and long-term disability coverage, and life insurance.
  • Wellbeing benefits along with 10 paid holidays, 10 paid sick days, and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).

Required Skills

SQL
Python
ETL
Spark
Tableau
Data Engineering
Big Data