1-Introduction

Strong Data Science Libraries : The SciPy Ecosystem

- pandas : Python Data Analysis Library

- matplotlib : 2D Plotting Library

Provides excellent basis to move into Machine Learning, Text Mining and Network Analysis

Jupyter Notebook

David Donoho 50 Years of Data Science

- Data Exploration and Preparation

- Data Representation and Transformation

- Computing with Data

- Data Modelling

- Data Visualization and Presentation

- Science about Data Science

Python Knowledge

- Functions

- Lists, Dictionaries, Tuples

- Strings

- Files

- Datetime

- Functional Approach - Map, Lambda

- Numpy

Pandas Toolkit

This toolkit brings functionality in R to Python world

Provides the data structure for thinking about data in tabular form.

Advanced Querying and manipulation with Pandas

Basic Statistical Analysis with numpy and scipy