Index
Symbols
- %automagic, Shell-Related Magic Commands
- %cpaste, Pasting Code Blocks: %paste and %cpaste
- %debug, Debugging: When Reading Tracebacks Is Not Enough
- %history, Related Magic Commands
- %lprun, Line-by-Line Profiling with %lprun
- %lsmagic, Help on Magic Functions: ?, %magic, and %lsmagic
- %magic, Help on Magic Functions: ?, %magic, and %lsmagic
- %matplotlib, Plotting from an IPython shell
- %memit, Profiling Memory Use: %memit and %mprun
- %mode, Controlling Exceptions: %xmode-Controlling Exceptions: %xmode
- %mprun, Profiling Memory Use: %memit and %mprun
- %paste, Pasting Code Blocks: %paste and %cpaste
- %prun, Profiling Full Scripts: %prun
- %run, Running External Code: %run
- %time, Timing Code Snippets: %timeit and %time-Timing Code Snippets: %timeit and %time
- %timeit, Timing Code Execution: %timeit, Timing Code Snippets: %timeit and %time-Timing Code Snippets: %timeit and %time
- & (ampersand), Boolean Arrays as Masks
- * (asterisk), Beyond tab completion: Wildcard matching
- : (colon), Array Slicing: Accessing Subarrays
- ? (question mark), Accessing Documentation with ?
- ?? (double question mark), Accessing Source Code with ??
- _ (underscore) shortcut, Underscore Shortcuts and Previous Outputs
- | (operator), Boolean Arrays as Masks
A
- absolute value function, Absolute value
- aggregate() method, Aggregation
- aggregates
- aggregation (NumPy), Aggregations: Min, Max, and Everything in Between-Example: What Is the Average Height of US Presidents?
- aggregation (Pandas), Aggregation and Grouping-Grouping example
- Akaike information criterion (AIC), How many components?, Example: GMM for Generating New Data
- Albers equal-area projection, Conic projections
- algorithmic efficiency
- ampersand (&), Boolean Arrays as Masks
- Anaconda, Installation Considerations
- and keyword, Boolean Arrays as Masks
- annotation of plots, Text and Annotation-Arrows and Annotation
- APIs (see Estimator API)
- append() method, Pandas vs. Python, The append() method
- apply() method, The apply() method
- arithmetic operators, Array arithmetic
- arrays
- accessing single rows/columns, Accessing array rows and columns
- arithmetic operators, Array arithmetic
- attributes, NumPy Array Attributes
- basics, The Basics of NumPy Arrays
- Boolean, Working with Boolean Arrays-Boolean operators
- broadcasting, Computation on Arrays: Broadcasting-Plotting a two-dimensional function
- centering, Centering an array
- computation on, Computation on NumPy Arrays: Universal Functions-Ufuncs: Learning More
- concatenation, Concatenation of arrays, Recall: Concatenation of NumPy Arrays
- creating copies, Creating copies of arrays
- creating from Python lists, Creating Arrays from Python Lists
- creating from scratch, Creating Arrays from Scratch
- data as, Introduction to NumPy
- DataFrame object as, DataFrame as a generalized NumPy array
- DataFrame object constructed from, From a two-dimensional NumPy array
- fixed-type, Fixed-Type Arrays in Python
- Index object as immutable array, Index as immutable array
- Index object vs., Index as ordered set
- indexing: accessing single elements, Array Indexing: Accessing Single Elements
- reshaping, Reshaping of Arrays
- Series object vs., Series as generalized NumPy array
- slicing, Array Slicing: Accessing Subarrays
- slicing multidimensional subarrays, Multidimensional subarrays
- slicing one-dimensional subarrays, Array Slicing: Accessing Subarrays
- sorting, Sorting Arrays-RecordArrays: Structured Arrays with a Twist
- specifying output to, Specifying output
- splitting, Splitting of arrays
- standard data types, NumPy Standard Data Types
- structured, Structured Data: NumPy’s Structured Arrays-RecordArrays: Structured Arrays with a Twist
- subarrays as no-copy views, Subarrays as no-copy views
- summing values in, Summing the Values in an Array
- universal functions, Computation on NumPy Arrays: Universal Functions-Ufuncs: Learning More
- arrows, Arrows and Annotation-Arrows and Annotation
- asfreq() method, Resampling and converting frequencies-Resampling and converting frequencies
- asterisk (*), Beyond tab completion: Wildcard matching
- automagic function, Shell-Related Magic Commands
- axes limits, Adjusting the Plot: Axes Limits-Adjusting the Plot: Axes Limits
B
- bagging, Ensembles of Estimators: Random Forests
- bandwidth (see kernel bandwidth)
- bar (|) operator, Boolean Arrays as Masks
- bar plots, Bar plots
- Basemap toolkit
- basis function regression, Derived Features, Basis Function Regression-Gaussian basis functions
- Bayesian classification, Bayesian Classification, Example: Not-So-Naive Bayes-Using our custom estimator
- (see also naive Bayes classification)
- Bayesian information criterion (BIC), How many components?
- Bayesian Methods for Hackers stylesheet, Bayesian Methods for Hackers style
- Bayess theorem, Bayesian Classification
- bias–variance trade-off
- bicycle traffic prediction
- big-O notation, Example: k-Nearest Neighbors
- binary ufuncs, Exploring NumPy’s UFuncs
- binnings, plt.hexbin: Hexagonal binnings
- bitwise logic operators, Boolean operators
- bogosort, Sorting Arrays
- Bokeh, Other Python Graphics Libraries
- Boolean arrays
- Boolean masks, Comparisons, Masks, and Boolean Logic-Boolean Arrays as Masks
- Boolean operators, Boolean operators
- broadcasting, Computation on Arrays: Broadcasting-Plotting a two-dimensional function
- adding two-dimensional array to one-dimensional array, Broadcasting example 1
- basics, Introducing Broadcasting-Introducing Broadcasting
- centering an array, Centering an array
- defined, Outer products, Computation on Arrays: Broadcasting
- in practice, Broadcasting in Practice
- plotting two-dimensional function, Plotting a two-dimensional function
- rules, Rules of Broadcasting-Broadcasting example 3
- two compatible arrays, Broadcasting example 2
- two incompatible arrays, Broadcasting example 3
C
- categorical data, Categorical Features
- class labels (for data point), Classification: Predicting discrete labels
- classification task
- clustering, Categories of Machine Learning
- code
- coefficient of determination, The bias–variance trade-off
- colon (:), Array Slicing: Accessing Subarrays
- color compression, Example 2: k-means for color compression-Example 2: k-means for color compression
- colorbars
- colormap, Customizing Colorbars-Choosing the colormap
- column(s)
- column-wise operations, DataFrame.eval() for Column-Wise Operations-Local variables in DataFrame.eval()
- command history shortcuts, Command History Shortcuts
- comparison operators, Comparison Operators as ufuncs-Comparison Operators as ufuncs
- concatenation
- confusion matrix, Classification on digits
- conic projections, Conic projections
- contour plots, Density and Contour Plots-Visualizing a Three-Dimensional Function
- Conway, Drew, What Is Data Science?
- cross-validation, Model validation via cross-validation-Validation curves in Scikit-Learn
- cubehelix colormap, Choosing the colormap
- cylindrical projections, Cylindrical projections
D
- data
- data representation (Scikit-Learn package), Data Representation in Scikit-Learn-Target array
- data science, defining, What Is Data Science?
- data types, Understanding Data Types in Python
- DataFrame object (Pandas), The Pandas DataFrame Object-From a NumPy structured array
- as dictionary, DataFrame as a dictionary-DataFrame as a dictionary
- as generalized NumPy array, DataFrame as a generalized NumPy array
- as specialized dictionary, DataFrame as specialized dictionary
- as two-dimensional array, DataFrame as two-dimensional array-DataFrame as two-dimensional array
- constructing, Constructing DataFrame objects
- data selection in, Data Selection in DataFrame
- defined, Data Manipulation with Pandas
- index alignment in, Index alignment in DataFrame
- masking, Additional indexing conventions
- multiply indexed, Multiply indexed DataFrames
- operations between Series object and, Ufuncs: Operations Between DataFrame and Series
- slicing, Additional indexing conventions
- DataFrame.eval() method, DataFrame.eval() for Column-Wise Operations-Local variables in DataFrame.eval()
- DataFrame.query() method, DataFrame.query() Method
- datasets
- datetime module, Native Python dates and times: datetime and dateutil
- datetime64 dtype, Typed arrays of times: NumPy’s datetime64
- dateutil module, Native Python dates and times: datetime and dateutil
- debugging, Debugging: When Reading Tracebacks Is Not Enough-Partial list of debugging commands
- decision trees, Motivating Random Forests: Decision Trees-Decision trees and overfitting
- deep learning, Caveats and Improvements
- density estimator
- describe() method, Dispatch methods
- development, IPython
- dictionary(-ies)
- digits, recognition of (see optical character recognition)
- dimensionality reduction, Example: Handwritten Digits
- discriminative classification, Motivating Support Vector Machines-Motivating Support Vector Machines
- documentation, accessing
- double question mark (??), Accessing Source Code with ??
- dropna() method, Dropping null values
- dynamic typing, Understanding Data Types in Python
E
- eigenfaces, Example: Eigenfaces-Example: Eigenfaces
- ensemble estimator/method, In-Depth: Decision Trees and Random Forests
- (see also random forests)
- ensemble learner, Motivating Random Forests: Decision Trees
- equidistant cylindrical projection, Cylindrical projections
- errors, visualizing
- Estimator API, Scikit-Learn’s Estimator API-Summary
- eval() function, pandas.eval() for Efficient Operations-Other operations
- exceptions, controlling, Controlling Exceptions: %xmode-Controlling Exceptions: %xmode
- expectation-maximization (E-M) algorithm
- exponentials, Exponents and logarithms
- external code, magic commands for running, Running External Code: %run
F
- face recognition
- faceted histograms, Faceted histograms
- factor plots, Factor plots
- fancy indexing, Fancy Indexing-Example: Binning Data
- feature engineering, Feature Engineering-Feature Pipelines
- feature, data point, Classification: Predicting discrete labels
- features matrix, Features matrix
- fillna() method, Filling null values
- filter() method, Filtering
- FiveThirtyEight stylesheet, FiveThirtyEight style
- fixed-type arrays, Fixed-Type Arrays in Python
G
- Gaussian basis functions, Gaussian basis functions-Gaussian basis functions
- Gaussian mixture models (GMMs), In Depth: Gaussian Mixture Models-Example: GMM for Generating New Data
- Gaussian naive Bayes classification, Supervised learning example: Iris classification, Classification on digits, Gaussian Naive Bayes-Gaussian Naive Bayes, HOG in Action: A Simple Face Detector
- Gaussian process regression (GPR), Continuous Errors
- generative models, Bayesian Classification
- geographic data, Geographic Data with Basemap
- get() operation, Vectorized item access and slicing
- get_dummies() method, Indicator variables
- ggplot stylesheet, ggplot
- graphics libraries, Other Python Graphics Libraries
- GroupBy aggregation, Pivot Tables
- GroupBy object, The GroupBy object-Dispatch methods
- groupby() operation (Pandas), GroupBy: Split, Apply, Combine-Grouping example
H
- handwritten digits, recognition of (see optical character recognition)
- hard negative mining, Caveats and Improvements
- help
- help() function, Accessing Documentation with ?
- hexagonal binnings, plt.hexbin: Hexagonal binnings
- hierarchical indexing
- Histogram of Oriented Gradients (HOG)
- histograms, Histograms, Binnings, and Density-Kernel density estimation
- binning data to create, Example: Binning Data
- faceted, Faceted histograms
- KDE and, Kernel density estimation, Motivating KDE: Histograms-Motivating KDE: Histograms
- manual customization, Plot Customization by Hand-Plot Customization by Hand
- plt.hexbin() function, plt.hexbin: Hexagonal binnings
- plt.hist2d() function, plt.hist2d: Two-dimensional histogram
- Seaborn, Histograms, KDE, and densities-Histograms, KDE, and densities
- simple, Histograms, Binnings, and Density-Histograms, Binnings, and Density
- two-dimensional, Two-Dimensional Histograms and Binnings-Kernel density estimation
- holdout sets, Model validation the right way: Holdout sets
- Hunter, John, Visualization with Matplotlib
- hyperparameters, Supervised learning example: Simple linear regression
- (see also model validation)
I
- iloc attribute (Pandas), Indexers: loc, iloc, and ix
- images, encoding for machine learning analysis, Image Features
- immutable array, Index object as, Index as immutable array
- importing, tab completion for, Tab completion when importing
- In objects, IPython, IPython’s In and Out Objects
- index alignment
- Index object (Pandas), The Pandas Index Object-Index as ordered set
- indexing
- IndexSlice object, Multiply indexed DataFrames
- indicator variables, Indicator variables
- inner join, Specifying Set Arithmetic for Joins
- input/output history, IPython, Input and Output History-Related Magic Commands
- installation, Python, Installation Considerations
- integers, Python, A Python Integer Is More Than Just an Integer
- IPython, IPython: Beyond Normal Python
- accessing documentation with ?, Accessing Documentation with ?
- accessing source code with ??, Accessing Source Code with ??
- command-line commands in shell, Shell Commands in IPython
- controlling exceptions, Controlling Exceptions: %xmode-Controlling Exceptions: %xmode
- debugging, Debugging: When Reading Tracebacks Is Not Enough-Partial list of debugging commands
- documentation, Help and Documentation in IPython-Beyond tab completion: Wildcard matching, Introduction to NumPy
- errors handling, Errors and Debugging-Partial list of debugging commands
- exploring modules with tab completion, Exploring Modules with Tab Completion-Tab completion when importing
- help and documentation, Help and Documentation in IPython-Beyond tab completion: Wildcard matching
- input/output history, Input and Output History-Related Magic Commands
- keyboard shortcuts in shell, Keyboard Shortcuts in the IPython Shell
- launching Jupyter notebook, Launching the Jupyter Notebook
- launching shell, Launching the IPython Shell
- magic commands, IPython Magic Commands-Help on Magic Functions: ?, %magic, and %lsmagic
- notebook (see Jupyter notebook)
- plotting from shell, Plotting from an IPython shell
- profiling and timing code, Profiling and Timing Code-Profiling Memory Use: %memit and %mprun
- shell commands, IPython and Shell Commands-Passing Values to and from the Shell
- shell-related magic commands, Shell-Related Magic Commands
- web resources, Web Resources
- wildcard matching, Beyond tab completion: Wildcard matching
- Iris dataset
- isnull() method, Detecting null values
- Isomap
- ix attribute (Pandas), Indexers: loc, iloc, and ix
K
- k-means clustering, Clustering: Inferring labels on unlabeled data, In Depth: k-Means Clustering-Example 2: k-means for color compression
- kernel (defined), Kernel Density Estimation in Practice
- kernel bandwidth
- kernel density estimation (KDE), In-Depth: Kernel Density Estimation-Using our custom estimator
- bandwidth selection via cross-validation, Selecting the bandwidth via cross-validation
- Bayesian generative classification with, Example: Not-So-Naive Bayes-Using our custom estimator
- custom estimator, Example: Not-So-Naive Bayes-Using our custom estimator
- histograms and, Motivating KDE: Histograms-Motivating KDE: Histograms
- in practice, Kernel Density Estimation in Practice-Using our custom estimator
- Matplotlib, Kernel density estimation
- Seaborn, Histograms, KDE, and densities
- visualization of geographic distributions, Example: KDE on a Sphere-Example: KDE on a Sphere
- kernel SVM, Beyond linear boundaries: Kernel SVM-Beyond linear boundaries: Kernel SVM
- kernel transformation, Beyond linear boundaries: Kernel SVM
- kernel trick, Beyond linear boundaries: Kernel SVM
- keyboard shortcuts, IPython shell, Keyboard Shortcuts in the IPython Shell
- Knuth, Donald, Profiling and Timing Code
L
- labels/labeling
- Lambert conformal conic projection, Conic projections
- lasso regularization (L1 regularization), Lasso regularization ()
- learning curves, computing, Learning curves in Scikit-Learn
- left join, Specifying Set Arithmetic for Joins
- left_index keyword, The left_index and right_index keywords-The left_index and right_index keywords
- legends, plot
- levels, naming, MultiIndex level names
- line plots
- line-by-line profiling, Line-by-Line Profiling with %lprun
- linear regression (in machine learning), In Depth: Linear Regression
- lists, Python, A Python List Is More Than Just a List-NumPy Standard Data Types
- loc attribute (Pandas), Indexers: loc, iloc, and ix
- locally linear embedding (LLE), Nonlinear Manifolds: Locally Linear Embedding-Nonlinear Manifolds: Locally Linear Embedding
- logarithms, Exponents and logarithms
M
- machine learning, Machine Learning
- basics, Machine Learning-Summary
- categories of, Categories of Machine Learning
- classification task, Classification: Predicting discrete labels-Classification: Predicting discrete labels
- clustering, Clustering: Inferring labels on unlabeled data-Clustering: Inferring labels on unlabeled data
- decision trees and random forests, In-Depth: Decision Trees and Random Forests
- defined, What Is Machine Learning?
- dimensionality reduction, Dimensionality reduction: Inferring structure of unlabeled data-Dimensionality reduction: Inferring structure of unlabeled data
- educational resources, Further Machine Learning Resources
- face detection pipeline, Application: A Face Detection Pipeline-Caveats and Improvements
- feature engineering, Feature Engineering-Feature Pipelines
- GMM (see Gaussian mixture models)
- hyperparameters and model validation, Hyperparameters and Model Validation-Summary
- KDE (see kernel density estimation)
- linear regression (see linear regression)
- manifold learning (see manifold learning)
- naive Bayes classification, In Depth: Naive Bayes Classification-When to Use Naive Bayes
- PCA (see principal component analysis)
- qualitative examples, Qualitative Examples of Machine Learning Applications-Dimensionality reduction: Inferring structure of unlabeled data
- regression task, Regression: Predicting continuous labels-Regression: Predicting continuous labels
- Scikit-Learn basics, Introducing Scikit-Learn
- supervised, Categories of Machine Learning
- SVMs (see support vector machines)
- unsupervised, Categories of Machine Learning
- magic commands
- manifold learning, In-Depth: Manifold Learning-Example: Visualizing Structure in Digits
- many-to-one joins, Many-to-one joins
- map projections, Map Projections-Other projections
- maps, geographic (see geographic data)
- margins, maximizing, Support Vector Machines: Maximizing the Margin-Tuning the SVM: Softening margins
- masking, Additional indexing conventions
- MATLAB-style interface, MATLAB-style interface
- Matplotlib, Visualization with Matplotlib, Matplotlib Resources
- axes limits for line plots, Adjusting the Plot: Axes Limits-Adjusting the Plot: Axes Limits
- changing defaults via rcParams, Changing the Defaults: rcParams
- colorbar customization, Customizing Colorbars-Example: Handwritten Digits
- configurations and stylesheets, Customizing Matplotlib: Configurations and Stylesheets-Seaborn style
- density and contour plots, Density and Contour Plots-Visualizing a Three-Dimensional Function
- error visualization, Visualizing Errors-Continuous Errors
- general tips, General Matplotlib Tips-Saving Figures to File
- geographic data with Basemap toolkit, Geographic Data with Basemap
- gotchas, Labeling Plots
- histograms, binnings, and density, Histograms, Binnings, and Density-Kernel density estimation
- importing, Importing matplotlib
- interfaces, Two Interfaces for the Price of One
- labeling simple line plots, Labeling Plots-Labeling Plots
- line colors and styles, Adjusting the Plot: Line Colors and Styles-Adjusting the Plot: Line Colors and Styles
- MATLAB-style interfaces, MATLAB-style interface
- multiple subplots, Multiple Subplots-plt.GridSpec: More Complicated Arrangements
- object hierarchy of plots, Customizing Ticks
- object-oriented interfaces, Object-oriented interface
- plot customization, Plot Customization by Hand-Plot Customization by Hand
- plot display contexts, show() or No show()? How to Display Your Plots-Plotting from an IPython notebook
- plot legend customization, Customizing Plot Legends-Multiple Legends
- plotting from a script, Plotting from a script
- plotting from IPython notebook, Plotting from an IPython notebook
- plotting from IPython shell, Plotting from an IPython shell
- resources and documentation for, Matplotlib Resources
- saving figures to file, Saving Figures to File
- Seaborn vs., Visualization with Seaborn-Seaborn Versus Matplotlib
- setting styles, Setting Styles
- simple line plots, Simple Line Plots-Labeling Plots
- stylesheets, Stylesheets-Seaborn style
- text and annotation, Text and Annotation-Arrows and Annotation
- three-dimensional function visualization, Visualizing a Three-Dimensional Function-Visualizing a Three-Dimensional Function
- three-dimensional plotting, Three-Dimensional Plotting in Matplotlib-Example: Visualizing a Möbius strip
- tick customization, Customizing Ticks-Summary of Formatters and Locators
- max() function, Minimum and Maximum
- maximum margin estimator, Support Vector Machines: Maximizing the Margin
- (see also support vector machines (SVMs))
- memory use, profiling, Profiling Memory Use: %memit and %mprun
- merge key
- merging, Combining Datasets: Merge and Join-Example: US States Data
- min() function, Minimum and Maximum
- Miniconda, Installation Considerations
- missing data, Missing Data in Pandas-NaN and None in Pandas
- Möbius strip, Example: Visualizing a Möbius strip-Example: Visualizing a Möbius strip
- model (defined), Classification: Predicting discrete labels
- model parameters (defined), Classification: Predicting discrete labels
- model selection
- model validation, Hyperparameters and Model Validation-Summary
- modules, IPython, Exploring Modules with Tab Completion-Tab completion when importing
- Mollweide projection, Pseudo-cylindrical projections
- multi-indexing (see hierarchical indexing)
- multidimensional scaling (MDS), MDS as Manifold Learning-MDS as Manifold Learning
- MultiIndex type, The better way: Pandas MultiIndex-MultiIndex as extra dimension
- creation methods, Methods of MultiIndex Creation-MultiIndex for columns
- data aggregations on, Data Aggregations on Multi-Indices
- explicit constructors for, Explicit MultiIndex constructors
- extra dimension of data with, MultiIndex as extra dimension
- for columns, MultiIndex for columns
- index setting/resetting, Index setting and resetting
- indexing and slicing, Indexing and Slicing a MultiIndex-Multiply indexed DataFrames
- keys option, Adding MultiIndex keys
- level names, MultiIndex level names
- multiply indexed DataFrames, Multiply indexed DataFrames
- multiply indexed Series, Multiply indexed Series
- rearranging, Rearranging Multi-Indices-Index setting and resetting
- sorted/unsorted indices with, Sorted and unsorted indices
- stacking/unstacking indices, Stacking and unstacking indices
- multinomial naive Bayes classification, Multinomial Naive Bayes-Example: Classifying text
N
- naive Bayes classification, In Depth: Naive Bayes Classification-When to Use Naive Bayes
- NaN value, From a list of dicts, Index alignment in Series, NaN: Missing numerical data
- navigation shortcuts, Navigation Shortcuts
- neural networks, Caveats and Improvements
- noise filter, PCA as, PCA as Noise Filtering-PCA as Noise Filtering
- None object, None: Pythonic missing data, NaN and None in Pandas
- nonlinear embeddings, MDS and, Nonlinear Embeddings: Where MDS Fails
- notnull() method, Detecting null values
- np.argsort() function, Fast Sorting in NumPy: np.sort and np.argsort
- np.concatenate() function, Concatenation of arrays, Duplicate indices
- np.sort() function, Fast Sorting in NumPy: np.sort and np.argsort
- null values, Operating on Null Values-Filling null values
- NumPy, Introduction to NumPy
- aggregations, Aggregations: Min, Max, and Everything in Between-Example: What Is the Average Height of US Presidents?
- array attributes, NumPy Array Attributes
- array basics, The Basics of NumPy Arrays
- array indexing: accessing single elements, Array Indexing: Accessing Single Elements
- array slicing: accessing subarrays, Array Slicing: Accessing Subarrays
- Boolean masks, Comparisons, Masks, and Boolean Logic-Boolean Arrays as Masks
- broadcasting, Computation on Arrays: Broadcasting-Plotting a two-dimensional function
- comparison operators as ufuncs, Comparison Operators as ufuncs-Comparison Operators as ufuncs
- computation on arrays, Computation on NumPy Arrays: Universal Functions-Ufuncs: Learning More
- data types in Python, Understanding Data Types in Python
- datetime64 dtype, Typed arrays of times: NumPy’s datetime64
- documentation, Introduction to NumPy
- fancy indexing, Fancy Indexing-Example: Binning Data
- keywords and/or vs. operators &/|, Boolean Arrays as Masks
- sorting arrays, Sorting Arrays-Example: k-Nearest Neighbors
- standard data types, NumPy Standard Data Types
- structured arrays, Structured Data: NumPy’s Structured Arrays-RecordArrays: Structured Arrays with a Twist
- universal functions, Computation on NumPy Arrays: Universal Functions-Ufuncs: Learning More
O
- object-oriented interface, Object-oriented interface
- offsets, time series, Frequencies and Offsets
- on keyword, The on keyword
- one-hot encoding, Categorical Features
- one-to-one joins, One-to-one joins
- optical character recognition
- digit classification, Classification on digits-Classification on digits
- GMMs, Example: GMM for Generating New Data-Example: GMM for Generating New Data
- k-means clustering, Example 1: k-Means on digits-Example 1: k-Means on digits
- loading/visualizing digits data, Loading and visualizing the digits data
- Matplotlib, Example: Handwritten Digits-Example: Handwritten Digits
- PCA as noise filtering, PCA as Noise Filtering-PCA as Noise Filtering
- PCA for visualization, PCA for visualization: Handwritten digits
- random forests for classifying digits, Example: Random Forest for Classifying Digits-Example: Random Forest for Classifying Digits
- Scikit-Learn application, Application: Exploring Handwritten Digits-Classification on digits
- visualizing structure in digits, Example: Visualizing Structure in Digits-Example: Visualizing Structure in Digits
- or keyword, Boolean Arrays as Masks
- ordered set, Index object as, Index as ordered set
- orthographic projection, Perspective projections
- Out objects, IPython, IPython’s In and Out Objects
- outer join, Specifying Set Arithmetic for Joins
- outer products, Outer products
- outliers, PCA and, Principal Component Analysis Summary
- output, suppressing, Suppressing Output
- overfitting, Learning Curves, Decision trees and overfitting
P
- pair plots, Pair plots
- Pandas, Data Manipulation with Pandas
- aggregation and grouping, Aggregation and Grouping-Grouping example
- and compound expressions, Motivating query() and eval(): Compound Expressions
- appending datasets, The append() method
- built-in documentation, Installing and Using Pandas
- combining datasets, Combining Datasets: Concat and Append-Example: US States Data
- concatenation of datasets, Combining Datasets: Concat and Append-Concatenation with joins
- data indexing and selection, Data Indexing and Selection
- data selection in DataFrame, Data Selection in DataFrame-Further Resources
- data selection in Series, Data Selection in Series-Indexers: loc, iloc, and ix
- DataFrame object, The Pandas DataFrame Object-From a NumPy structured array
- eval() and query(), High-Performance Pandas: eval() and query()-High-Performance Pandas: eval() and query()
- handling missing data, Handling Missing Data-Trade-Offs in Missing Data Conventions
- hierarchical indexing, Hierarchical Indexing-Data Aggregations on Multi-Indices
- Index object, The Pandas Index Object-Index as ordered set
- installation, Installing and Using Pandas
- merging/joining datasets, Combining Datasets: Merge and Join-Example: US States Data
- NaN and None in, NaN and None in Pandas
- null values, Operating on Null Values-Filling null values
- objects, Introducing Pandas Objects-Index as ordered set
- operating on data in, Operating on Data in Pandas-Filling null values
- (see also universal functions)
- pandas.eval(), pandas.eval() for Efficient Operations-Other operations
- Panel data, Data Aggregations on Multi-Indices
- pivot tables, Pivot Tables-Further data exploration
- Series object, The Pandas Series Object-Constructing Series objects
- time series, Working with Time Series-Performance: When to Use These Functions
- vectorized string operations, Vectorized String Operations-Going further with recipes
- pandas.eval() function, pandas.eval() for Efficient Operations-Other operations
- Panel data, Data Aggregations on Multi-Indices
- partial slicing, Multiply indexed Series
- partitioning (partial sorts), Partial Sorts: Partitioning
- pasting code blocks, magic commands for, Pasting Code Blocks: %paste and %cpaste
- pd.concat() function
- pd.date_range() function, Regular sequences: pd.date_range()
- pd.eval() function, pandas.eval() for Efficient Operations-Other operations
- pd.merge() function, Combining Datasets: Merge and Join-Example: US States Data
- pdb (Python debugger), Debugging: When Reading Tracebacks Is Not Enough
- Perez, Fernando, IPython: Beyond Normal Python, Visualization with Matplotlib
- Period type, Pandas Time Series Data Structures
- perspective projections, Perspective projections
- pipelines, Validation curves in Scikit-Learn, Feature Pipelines
- pivot tables, Pivot Tables-Further data exploration
- Planets dataset
- plot legends
- Plotly, Other Python Graphics Libraries
- plotting
- axes limits for simple line plots, Adjusting the Plot: Axes Limits-Adjusting the Plot: Axes Limits
- bar plots, Bar plots
- changing defaults via rcParams, Changing the Defaults: rcParams
- colorbars, Customizing Colorbars-Example: Handwritten Digits
- data on maps, Plotting Data on Maps-Example: Exploring Marathon Finishing Times
- density and contour plots, Density and Contour Plots-Visualizing a Three-Dimensional Function
- display contexts, show() or No show()? How to Display Your Plots-Plotting from an IPython notebook
- factor plots, Factor plots
- from an IPython shell, Plotting from an IPython shell
- from script, Plotting from a script
- histograms, binnings, and density, Histograms, Binnings, and Density-Kernel density estimation
- IPython notebook, Plotting from an IPython notebook
- joint distributions, Joint distributions
- labeling simple line plots, Labeling Plots-Labeling Plots
- line colors and styles, Adjusting the Plot: Line Colors and Styles-Adjusting the Plot: Line Colors and Styles
- manual customization, Plot Customization by Hand-Plot Customization by Hand
- Matplotlib, Visualization with Matplotlib
- multiple subplots, Multiple Subplots-plt.GridSpec: More Complicated Arrangements
- of errors, Visualizing Errors-Continuous Errors
- pair plots, Pair plots
- plot legends, Customizing Plot Legends-Multiple Legends
- Seaborn, Visualization with Seaborn-Seaborn Versus Matplotlib
- simple line plots, Simple Line Plots-Labeling Plots
- simple scatter plots, Simple Scatter Plots-plot Versus scatter: A Note on Efficiency
- stylesheets for, Stylesheets-Seaborn style
- text and annotation for, Text and Annotation-Arrows and Annotation
- three-dimensional, Three-Dimensional Plotting in Matplotlib-Example: Visualizing a Möbius strip
- three-dimensional function, Visualizing a Three-Dimensional Function-Visualizing a Three-Dimensional Function
- ticks, Customizing Ticks-Summary of Formatters and Locators
- two-dimensional function, Plotting a two-dimensional function
- various Python graphics libraries, Other Python Graphics Libraries
- plt.axes() function, plt.axes: Subplots by Hand-plt.axes: Subplots by Hand
- plt.contour() function, Visualizing a Three-Dimensional Function-Visualizing a Three-Dimensional Function
- plt.GridSpec() function, plt.GridSpec: More Complicated Arrangements-plt.GridSpec: More Complicated Arrangements
- plt.imshow() function, Visualizing a Three-Dimensional Function-Visualizing a Three-Dimensional Function
- plt.legend() command, Customizing Plot Legends-Multiple Legends
- plt.plot() function
- plt.scatter() function
- plt.subplot() function, plt.subplot: Simple Grids of Subplots
- plt.subplots() function, plt.subplots: The Whole Grid in One Go
- polynomial basis functions, Polynomial basis functions
- polynomial regression model, Validation curves in Scikit-Learn
- pop() method, DataFrame as a dictionary
- population data, US, merge and join operations with, Example: US States Data-Example: US States Data
- principal axes, Introducing Principal Component Analysis-Introducing Principal Component Analysis
- principal component analysis (PCA), In Depth: Principal Component Analysis-General Machine Learning
- basics, Introducing Principal Component Analysis-PCA as Noise Filtering
- choosing number of components, Choosing the number of components
- eigenfaces example, Example: Eigenfaces-Example: Eigenfaces
- facial recognition example, Example: Eigenfaces-Example: Eigenfaces
- for dimensionality reduction, PCA as dimensionality reduction
- handwritten digit example, PCA for visualization: Handwritten digits-Choosing the number of components, PCA as Noise Filtering-PCA as Noise Filtering
- manifold learning vs., Some Thoughts on Manifold Methods
- meaning of components, What do the components mean?-What do the components mean?
- noise filtering, PCA as Noise Filtering-PCA as Noise Filtering
- strengths/weaknesses, Principal Component Analysis Summary
- visualization with, PCA for visualization: Handwritten digits
- profiling
- projections (see map projections)
- pseudo-cylindrical projections, Pseudo-cylindrical projections
- Python
R
- radial basis function, Beyond linear boundaries: Kernel SVM
- rainfall statistics, Example: Counting Rainy Days
- random forests
- RandomizedPCA, Example: Eigenfaces
- rcParams dictionary, changing defaults via, Changing the Defaults: rcParams
- RdBu colormap, Choosing the colormap
- record arrays, RecordArrays: Structured Arrays with a Twist
- reduce() method, Aggregates
- regression, Random Forest Regression-Summary of Random Forests
- (see also specific forms, e.g.: linear regression)
- regression task
- regular expressions, Methods using regular expressions
- regularization, Regularization-Lasso regularization ()
- relational algebra, Relational Algebra
- resample() method, Resampling and converting frequencies-Resampling and converting frequencies
- reset_index() method, Index setting and resetting
- reshaping, Reshaping of Arrays
- ridge regression (L2 regularization), Ridge regression ( regularization)
- right join, Specifying Set Arithmetic for Joins
- right_index keyword, The left_index and right_index keywords-The left_index and right_index keywords
- rolling statistics, Rolling windows
- runtime configuration (rc), Changing the Defaults: rcParams
S
- scatter plots (see simple scatter plots)
- Scikit-Learn package, Machine Learning, Data Representation in Scikit-Learn-Target array
- scipy.special submodule, Specialized ufuncs
- script
- Seaborn
- bar plots, Bar plots
- datasets and plot types, Exploring Seaborn Plots-Example: Exploring Marathon Finishing Times
- faceted histograms, Faceted histograms
- factor plots, Factor plots
- histograms, KDE, and densities, Histograms, KDE, and densities-Histograms, KDE, and densities
- joint distributions, Joint distributions
- marathon finishing times example, Example: Exploring Marathon Finishing Times-Example: Exploring Marathon Finishing Times
- Matplotlib vs., Visualization with Seaborn-Seaborn Versus Matplotlib
- pair plots, Pair plots
- stylesheet, Seaborn style
- visualization with, Visualization with Seaborn-Seaborn Versus Matplotlib
- Seattle, bicycle traffic prediction in
- Seattle, rainfall statistics in, Example: Counting Rainy Days
- semi-supervised learning, Categories of Machine Learning
- Series object (Pandas), The Pandas Series Object-Constructing Series objects
- as dictionary, Series as specialized dictionary, Series as dictionary
- constructing, Constructing Series objects
- data indexing/selection in, Data Selection in Series-Indexers: loc, iloc, and ix
- DataFrame as dictionary of, DataFrame as a dictionary-DataFrame as a dictionary
- DataFrame object constructed from, From a single Series object
- DataFrame object constructed from dictionary of, From a dictionary of Series objects
- generalized NumPy array, Series as generalized NumPy array
- hierarchical indexing in, A Multiply Indexed Series-Data Aggregations on Multi-Indices
- index alignment in, Index alignment in Series
- indexer attributes, Indexers: loc, iloc, and ix
- multiply indexed, Multiply indexed Series
- one-dimensional array, Series as one-dimensional array
- operations between DataFrame and, Ufuncs: Operations Between DataFrame and Series
- shell, IPython
- shift() function, Time-shifts-Time-shifts
- shortcuts
- simple histograms, Histograms, Binnings, and Density-Histograms, Binnings, and Density
- simple line plots
- simple linear regression, Simple Linear Regression-Simple Linear Regression
- simple scatter plots
- slice() operation, Vectorized item access and slicing
- slicing
- sorting arrays, Sorting Arrays-Example: k-Nearest Neighbors
- source code, accessing, Accessing Source Code with ??
- splitting arrays, Splitting of arrays
- string operations (see vectorized string operations)
- structured arrays, Structured Data: NumPy’s Structured Arrays-RecordArrays: Structured Arrays with a Twist
- stylesheets
- subarrays
- subplots
- subsets, faceted histograms, Faceted histograms
- suffixes keyword, Overlapping Column Names: The suffixes Keyword
- supervised learning, Categories of Machine Learning
- support vector (defined), Fitting a support vector machine
- support vector classifier, Fitting a support vector machine-Fitting a support vector machine
- support vector machines (SVMs), In-Depth: Support Vector Machines
- surface plots, three-dimensional, Wireframes and Surface Plots-Example: Visualizing a Möbius strip
T
- t-distributed stochastic neighbor embedding (t-SNE), Some Thoughts on Manifold Methods, Example 1: k-Means on digits
- tab completion
- table, data as, Data as table
- target array, Target array-Target array
- term frequency-inverse document frequency (TF-IDF), Text Features
- text, Text Features
- text entry shortcuts, Text Entry Shortcuts
- three-dimensional plotting
- ticks (tick marks)
- Tikhonov regularization, Ridge regression ( regularization)
- time series
- bar plots, Bar plots
- dates and times in Pandas, Dates and times in Pandas: Best of both worlds
- datetime64, Typed arrays of times: NumPy’s datetime64
- frequency codes, Frequencies and Offsets
- indexing data by timestamps, Pandas Time Series: Indexing by Time
- native Python dates and times, Native Python dates and times: datetime and dateutil
- offsets, Frequencies and Offsets
- Pandas, Working with Time Series-High-Performance Pandas: eval() and query()
- Pandas data structures for, Pandas Time Series Data Structures-Regular sequences: pd.date_range()
- pd.date_range(), Regular sequences: pd.date_range()
- Python vs. Pandas, Dates and Times in Python-Dates and times in Pandas: Best of both worlds
- resampling and converting frequencies, Resampling and converting frequencies-Resampling and converting frequencies
- rolling statistics, Rolling windows
- Seattle bicycle counts example, Example: Visualizing Seattle Bicycle Counts-High-Performance Pandas: eval() and query()
- time-shifts, Time-shifts-Time-shifts
- typed arrays, Typed arrays of times: NumPy’s datetime64
- Timedelta type, Pandas Time Series Data Structures
- Timestamp type, Pandas Time Series Data Structures
- timestamps, indexing data by, Pandas Time Series: Indexing by Time
- timing, of code, Timing Code Execution: %timeit, Timing Code Snippets: %timeit and %time-Timing Code Snippets: %timeit and %time
- transform() method, Transformation
- transforms
- triangulated surface plots, Surface Triangulations-Example: Visualizing a Möbius strip
- trigonometric functions, Trigonometric functions
- tshift() function, Time-shifts-Time-shifts
- two-fold cross-validation, Model validation via cross-validation
U
- ufuncs (see universal functions)
- unary ufuncs, Exploring NumPy’s UFuncs
- underfitting, The bias–variance trade-off, Learning Curves
- underscore (_) shortcut, Underscore Shortcuts and Previous Outputs
- universal functions (ufuncs), Computation on NumPy Arrays: Universal Functions-Ufuncs: Learning More
- absolute value, Absolute value
- advanced features, Advanced Ufunc Features
- aggregates, Aggregates
- array arithmetic, Array arithmetic
- basics, Introducing UFuncs
- comparison operators as, Comparison Operators as ufuncs-Comparison Operators as ufuncs
- exponentials, Exponents and logarithms
- index alignment, UFuncs: Index Alignment-Index alignment in DataFrame
- index preservation, Ufuncs: Index Preservation
- logarithms, Exponents and logarithms
- operating on data in Pandas, Operating on Data in Pandas-Filling null values
- operations between DataFrame and Series, Ufuncs: Operations Between DataFrame and Series
- outer products, Outer products
- slowness of Python loops, The Slowness of Loops
- specialized ufuncs, Specialized ufuncs
- specifying output, Specifying output
- trigonometric functions, Trigonometric functions
- unstack() method, MultiIndex as extra dimension
- unsupervised learning