<?xml version="1.0" encoding="utf-8" standalone="yes" ?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>spark on Prime&#39;s blog</title>
    <link>https://www.mayuan.site/tags/spark/</link>
    <description>Recent content in spark on Prime&#39;s blog</description>
    <generator>Hugo -- gohugo.io</generator>
    <language>en</language>
    <lastBuildDate>Tue, 24 Dec 2019 10:22:52 +0800</lastBuildDate>
    
	<atom:link href="https://www.mayuan.site/tags/spark/index.xml" rel="self" type="application/rss+xml" />
    
    
    <item>
      <title>Spark基础</title>
      <link>https://www.mayuan.site/post/bigdata/spark%E5%9F%BA%E7%A1%80/</link>
      <pubDate>Tue, 24 Dec 2019 10:22:52 +0800</pubDate>
      
      <guid>https://www.mayuan.site/post/bigdata/spark%E5%9F%BA%E7%A1%80/</guid>
      <description>spark初探 http://datastrophic.io/ 基本概念 Job: A piece of code which reads some input from HDFS or local, performs some computation on the data and writes some output data. Stages: Jobs are divided into stages. Stages are classified as a Map or reduce stages (Its easier to understand if you have worked on Hadoop and want to correlate). Stages are divided based on</description>
    </item>
    
  </channel>
</rss>