13. Exercise 1#

The following will be used as an in-class exercise.

  1. Write a function Square that takes an integer argument and outputs the square value of this argument. For example, if the input is 3, output should be 9.

  1. Write a comprehensive code to initialize a dictionary where values are squares of keys – keys from 1 to 10.

#Keep this here: Loading Data. 
import pandas as pd
df=pd.read_csv('https://raw.githubusercontent.com/rpi-techfundamentals/spring2019-materials/master/input/train.csv')
df
PassengerId Survived Pclass Name Sex Age SibSp Parch Ticket Fare Cabin Embarked
0 1 0 3 Braund, Mr. Owen Harris male 22.0 1 0 A/5 21171 7.2500 NaN S
1 2 1 1 Cumings, Mrs. John Bradley (Florence Briggs Th... female 38.0 1 0 PC 17599 71.2833 C85 C
2 3 1 3 Heikkinen, Miss. Laina female 26.0 0 0 STON/O2. 3101282 7.9250 NaN S
3 4 1 1 Futrelle, Mrs. Jacques Heath (Lily May Peel) female 35.0 1 0 113803 53.1000 C123 S
4 5 0 3 Allen, Mr. William Henry male 35.0 0 0 373450 8.0500 NaN S
... ... ... ... ... ... ... ... ... ... ... ... ...
886 887 0 2 Montvila, Rev. Juozas male 27.0 0 0 211536 13.0000 NaN S
887 888 1 1 Graham, Miss. Margaret Edith female 19.0 0 0 112053 30.0000 B42 S
888 889 0 3 Johnston, Miss. Catherine Helen "Carrie" female NaN 1 2 W./C. 6607 23.4500 NaN S
889 890 1 1 Behr, Mr. Karl Howell male 26.0 0 0 111369 30.0000 C148 C
890 891 0 3 Dooley, Mr. Patrick male 32.0 0 0 370376 7.7500 NaN Q

891 rows × 12 columns

  1. Find the median of the age column and assign it to the age_median variable.

  1. Count the number of NaN in the Age column.

  1. Replace the NAN with the age_median.

  1. Create a Pivot table which examines the survived by Embarked and Sex columns.

13.1. Challenge Problem#

  1. Create a function which accepts a dataframe (df) and a list of columns (cols), and a function to use (use with potential values mean or median or mode). For each of the columns listed the function should replace NaN with the appropriate value, returning a dataframe.

Add your solution to this notebook:

https://colab.research.google.com/drive/1QDeA-aIjC9o2f638Hmhu_xHBQf3W1CXs?usp=sharing

Put your name.