【腾讯】数据科学平台研发工程师(深圳/北京)
全职社招TEG技术地点:深圳状态:招聘
工作描述
任职要求
1.硕士及以上学历,计算机科学、数据科学、统计学、应用数学等相关专业,掌握常用的统计学方法,熟悉常用的因果推断方法,熟练使用SQL、Python; 2.具备优秀的代码工程能力,精通java,熟悉spring cloud、k8s、微服务,服务治理; 3.熟悉DATA+AI,拥有工具类产品相关数据分析经验优先,熟悉databricks、dataiku、拥有Spark、Flink等平台的海量数据处理经验、拥有feast特征平台、triton推理框架等系统经验优先; 4.具有优秀的学习能力、沟通能力、团队合作意识;强烈的责任心与主动性,对所负责工作有owner意识,并能自我驱动成长。
工作职责
1.负责数据科学平台dataops+mlops+devops相关工具链(包括Notebook、数据标注、合成、特征、模型、推理、Agent应用等)的设计和开发工作; 2.负责优化系统架构,提升在线特征、推理等服务的性能和稳定性,提升研发质量和效率。
包括英文材料
学历
数据科学
因果推断
SQL+
https://www.youtube.com/watch?v=p3qvj9hO_Bo
In this video we will cover everything you need to know about SQL in only 60 minutes.
Python+
https://www.youtube.com/watch?v=K5KVEU3aaeQ
Master Python from scratch 🚀 No fluff—just clear, practical coding skills to kickstart your journey!
https://www.youtube.com/watch?v=rfscVS0vtbw
This course will give you a full introduction into all of the core concepts in python.
Java+
https://www.youtube.com/watch?v=eIrMbAQSU34
Master Java – a must-have language for software development, Android apps, and more! ☕️ This beginner-friendly course takes you from basics to real coding skills.
Spring Cloud
Kubernetes+
https://kubernetes.io/docs/tutorials/kubernetes-basics/
This tutorial provides a walkthrough of the basics of the Kubernetes cluster orchestration system.
https://www.youtube.com/watch?v=s_o8dwzRlu4
Hands-On Kubernetes Tutorial | Learn Kubernetes in 1 Hour - Kubernetes Course for Beginners
https://www.youtube.com/watch?v=X48VuDVv0do
Full Kubernetes Tutorial | Kubernetes Course | Hands-on course with a lot of demos
微服务+
https://www.youtube.com/watch?v=CqCDOosvZIk
https://www.youtube.com/watch?v=hmkF77F9TLw
Learn about software system design and microservices.
服务治理
数据分析
Spark+
[英文] Learning Spark Book
https://pages.databricks.com/rs/094-YMS-629/images/LearningSpark2.0.pdf
This new edition has been updated to reflect Apache Spark’s evolution through Spark 2.x and Spark 3.0, including its expanded ecosystem of built-in and external data sources, machine learning, and streaming technologies with which Spark is tightly integrated.
Flink+
https://nightlies.apache.org/flink/flink-docs-release-2.0/docs/learn-flink/overview/
This training presents an introduction to Apache Flink that includes just enough to get you started writing scalable streaming ETL, analytics, and event-driven applications, while leaving out a lot of (ultimately important) details.
https://www.youtube.com/watch?v=WajYe9iA2Uk&list=PLa7VYi0yPIH2GTo3vRtX8w9tgNTTyYSux
Today’s businesses are increasingly software-defined, and their business processes are being automated. Whether it’s orders and shipments, or downloads and clicks, business events can always be streamed. Flink can be used to manipulate, process, and react to these streaming events as they occur.