Computer Science Notes

Notes From CS Undergrad Courses FSU

This project is maintained by awa03

Spark

Stack

SPARK SQL Spark Streaming GraphX MLIB
(real-time) graph (Machine Learning)

Data Model

Fault Tolerance

RDD's track lineage info to rebuild lost data.

file.map(lamba rec: (rec.type, 1))
	.reduceByKey(lambda x, y: x + y)
	.filter(lambda (type, count) : count > 100)

Data Model

Version Control

True Time API

Methods

Instantaneous error bound $\epsilon$


Google TPU