Infrastructure Site Reliability Engineer (Entry Level)- USDS
TikTok4 months ago
San Jose, CA, United States
On-site
Full-time
Beginner Level (< 1 year)
Job Description
Position Overview
TikTok’s USDS team is seeking an Infrastructure Site Reliability Engineer (Entry Level) based in San Jose. In this role, you will join the Site Reliability Engineering team that combines software and systems engineering to build and run large‐scale, massively distributed, and fault‐tolerant systems. You’ll work on challenges of scale, design and support resilient systems, and contribute to a culture of diversity, intellectual curiosity, and collaboration.
Key Responsibilities
- Engage in and improve the entire lifecycle of services—from inception and design to development, capacity planning, launch reviews, deployment, operation, and automation.
- Design and implement dashboards and monitoring frameworks for efficient, automated, and intelligent SOA governance.
- Scale systems elastically using automation while driving improvements in reliability, efficiency, and system velocity.
- Practice efficient customer support, incident response, and conduct blameless postmortems.
Required Qualifications
- Bachelor's degree in Computer Science or a related technical field.
- Industrial or internship experience in accredited internet or cloud companies.
- Proficiency in one of the following programming languages: Python, GoLang, Java, or Shell.
- Familiarity with Linux system internals, networking, and distributed systems.
- Strong interpersonal and communication skills.
Preferred Qualifications
- Experience with MySQL, Redis, Kubernetes, Docker, Hadoop, Spark, Flink, HDFS, etc.
- Expertise in designing and analyzing large-scale distributed systems.
Benefits & Perks
- Compensation: Base salary range is $118657 - $187200 annually, with additional discretionary bonuses and restricted stock units based on performance.
- Access to medical, dental, and vision insurance from day one.
- 401(k) savings plan with company match.
- Paid parental leave, short-term and long-term disability coverage, and life insurance.
- Additional wellbeing benefits, including 10 paid holidays, 10 paid sick days, and 17 days of Paid Personal Time (prorated upon hire).
Required Skills
Java
Flink
Spark
Shell
Redis
Kubernetes
Linux system internals
Docker
GoLang
HDFS
Networking
MySQL
Python
Hadoop
Distributed Systems