Feature Engineering Made Easy
上QQ阅读APP看书,第一时间看更新

Conventions used

There are a number of text conventions used throughout this book.

CodeInText: Indicates code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles. Here is an example: "Suppose further that given this dataset, our task is to be able to take in three of the attributes (datetimeprotocol, and urgent) and to be able to accurately predict the value of malicious. In layman's terms, we want a system that can map the values of datetimeprotocol, and urgent to the values in malicious."

A block of code is set as follows:

Network_features = pd.DataFrame({'datetime': ['6/2/2018', '6/2/2018', '6/2/2018', '6/3/2018'], 'protocol': ['tcp', 'http', 'http', 'http'], 'urgent': [False, True, True, False]})
Network_response = pd.Series([True, True, False, True])
Network_features
>>
datetime protocol urgent 0 6/2/2018 tcp False 1 6/2/2018 http True 2 6/2/2018 http True 3 6/3/2018 http False
Network_response
>>
0 True 1 True 2 False 3 True dtype: bool

When we wish to draw your attention to a particular part of a code block, the relevant lines or items are set in bold:

times_pregnant                  0.221898
plasma_glucose_concentration    0.466581
diastolic_blood_pressure        0.065068
triceps_thickness               0.074752
serum_insulin                   0.130548
bmi                             0.292695
pedigree_function               0.173844
age                             0.238356
onset_diabetes                  1.000000
Name: onset_diabetes, dtype: float64

Bold: Indicates a new term, an important word, or words that you see onscreen. 

Warnings or important notes appear like this.

Tips and tricks appear like this.