Similar Jobs
See allData Analyst Intern
Bybit
Global
SQL
Python
Linux
AI Integration & Data Engineer (Vibecoding) - Master-Level Internship
Vosyn
Canada
Python
Node.js
SQL
Analytics Engineer
Experian
US
Python
PySpark
NumPy
Data Engineer (Azure)
Bluelight
Latin America
Python
PySpark
SQL
Staff Software Engineer, Data
Credit Acceptance
India
Python
Apache Spark
Databricks
Responsibilities:
- Analyze Spark/YARN job resource usage and identify inefficiencies like CPU idle, memory waste, and small file problems.
- Build job profiling systems with classification, resource baseline modeling, and historical trend analysis.
- Produce optimization reports and drive business owners to implement improvements.
Requirements:
- Currently enrolled in a Bachelor's or Master's program in Computer Science, Software Engineering, or a related field.
- Proficient in Python with ability to write clean, maintainable scripts and tools independently.
- Must be fluent in AI coding tools (Cursor, Copilot, ChatGPT, Claude) for development and troubleshooting.
What We're Looking For:
- Comfortable with Linux: running programs on servers, reading logs, debugging issues.
- Bonus points for experience with Spark/Hive/Flink, AWS basics (S3, EMR, Athena), or Shell scripts.
- Bilingual English/Mandarin is an added advantage to coordinate with overseas partners.
Binance
Binance is a leading global blockchain ecosystem behind the world's largest cryptocurrency exchange by trading volume. The company is trusted by over 300 million people in 100+ countries for its industry-leading security and diverse digital-asset products.