课程描述
Spark and Python for Big Data with PySpark is the name of a training course on Udemy that teaches how to use Spark and Python in metropolitan areas. In this course, you will also learn about PySpark and Spark Streaming, and you will learn machine learning topics well. One of the most valuable technologies in the field of technology is the ability to analyze large data sets, and this course tries to acquaint you with one of the best technologies in this field, namely Apache Spark.
Spark and Python for Big Data with PySpark 是 Udemy 上的一个培训课程的名称,该课程教授如何在大都市地区使用 Spark 和 Python。在本课程中,您还将学习 PySpark 和 Spark Streaming,您将很好地学习机器学习主题。技术领域最有价值的技术之一是分析大型数据集的能力,本课程试图让您熟悉该领域最好的技术之一,即 Apache Spark。
Apache Spark is an open source framework and distributed system that many large companies such as Google, Facebook, Amazon, and NASA use to solve big data problems. This framework can provide up to 100 times better performance than Hadoop MapReduce and fully meet your needs. During this course you will become fully acquainted with Spark and learn how to use Python language in macro data.
Apache Spark 是一个开源框架和分布式系统,谷歌、Facebook、亚马逊和 NASA 等许多大公司都使用它来解决大数据问题。该框架可提供比Hadoop MapReduce 高出100 倍的性能,充分满足您的需求。在本课程中,您将全面熟悉 Spark,并学习如何在宏数据中使用 Python 语言。
Courses taught in this course:; 本课程讲授的课程:
- Use Spark and Python to analyze big data
- 使用Spark和Python分析大数据
- Use Spark 2.0 DataFrame syntax
- 使用 Spark 2.0 DataFrame 语法
- Work on consulting projects
- 从事咨询项目
- Use Spark with Random Forests for classification
- 使用带有随机森林的 Spark 进行分类
- Use a reinforcing gradient tree
- 使用增强梯度树
- Use MLlib to build machine learning models
- 使用 MLlib 构建机器学习模型
Spark and Python specifications for Big Data with PySpark:; PySpark 大数据的 Spark 和 Python 规范:
- English language
- 英语
- Duration: 10 hours and 35 minutes
- 持续时间:10 小时 35 分钟
- Number of lessons: 66
- 课时数:66
- Level of education: Medium
- 教育程度:中等
- Instructor: Jose Portilla
- 教练:何塞波蒂利亚
- File format: mp4
- 文件格式:mp4
Course topics; 课程主题
66 lectures 10:35:05
66 讲 10:35:05
Introduction to Course
4 lectures 30:12
课程介绍 4 讲座 30:12
Setting up Python with Spark
2 lectures 06:11
使用 Spark 设置 Python 2 个讲座 06:11
Local VirtualBox Set-up
3 lectures 31:09
本地 VirtualBox 设置 3 个讲座 31:09
AWS EC2 PySpark Set-up
4 lectures 38:58
AWS EC2 PySpark 设置 4 个讲座 38:58
Setup Databricks
1 lecture 11:41
设置 Databricks 1 个讲座 11:41
AWS EMR Cluster Setup
1 lecture 17:16
AWS EMR 集群设置 1 讲 17:16
Python Crash Course
7 lectures 58:50
Python 速成班 7 讲 58:50
Spark DataFrame Basics
7 lectures 01:04:52
Spark DataFrame 基础知识 7 个讲座 01:04:52
Spark DataFrame Project Exercise
2 lectures 20:06
Spark DataFrame 项目练习 2 讲座 20:06
Introduction to Machine Learning with MLlib
2 lectures 19:25
MLlib 机器学习简介 2 个讲座 19:25
Linear Regression
6 lectures 01:00:03
线性回归 6 个讲座 01:00:03
Logistic Regression
5 lectures 01:00:02
逻辑回归 5 个讲座 01:00:02
Decision Trees and Random Forests
5 lectures 52:26
决策树和随机森林 5 个讲座 52:26
K-means Clustering
5 lectures 41:20
K 均值聚类 5 个讲座 41:20
Collaborative Filtering for Recommender Systems
2 lectures 18:39
推荐系统的协同过滤 2 个讲座 18:39
Natural Language Processing
4 lectures 46:26
自然语言处理 4 讲 46:26
Spark Streaming with Python
5 lectures 57:18
Spark Streaming with Python 5 讲座 57:18
Bonus
1 lecture 00:10
奖励 1 节课 00:10
Course prerequisites; 课程先决条件
- General Programming Skills in any Language (Preferrably Python)
- 任何语言的通用编程技能(最好是 Python)
- 20 GB of free space on your local computer (or alternatively a strong internet connection for AWS)
- 本地计算机上 20 GB 的可用空间(或者 AWS 的强大互联网连接)
Pictures; 图片

Sample movie; 样片
Media error: Format(s) not supported or source(s) not found
媒体错误:格式不受支持或来源未找到
Installation guide; 安装指南
After Extract, view with your favorite Player.
Extract 后,用您最喜欢的播放器观看。
English subtitle
英文字幕
Quality: 720p
画质:720p




