AdaBoosting process
dataset
iteration 1
iteration 2
iteration 3
vs. stand-alone decision tree model
steps
weak classification models
Agglomerative clustering
Analytics
categorization
descriptive analytics
diagnostics
predictions/estimations
prescriptive
types
Artificial general intelligence (AGI)
Artificial intelligence (AI)
analytics
data mining
data science
evolution
AGI
ANI
ASI
data analytics
data mining
data science
definition
statistics
statistics
Bayesian
frequentist
regression
statistics vs. data mining vs. data analytics vs. data science
Artificial narrow intelligence (ANI)
Artificial neural network (ANN)
activation function
autoencoders
biological neurons
CNN
CNN and MNIST dataset
correponding array
deep learning
handwritten digit(zero)image
hidden_layer_sizes
image classification
learning_rate_init
load MNIST data
LSTM
max_iter
MLP and Keras
MLP (feedforward network)
multilayer perceptron representation
multilayer perceptrons (feedforward neural network)
perceptron
RBM algorithm
reinforcement learning
scikit-learn MLP
solver
transfer learning
visual challenges
visualization
visual pathway
Artificial super intelligence (ASI)
Autocorrelation function (ACF)
Autoencoders
de-noise image
dimension reduction
elements
Autoregressive integrated moving average (ARIMA)
AM and MA
autocorrelation
build model and evaluate
check stationary
decompose time series
model
predict function
predictors
Autoregressive model (AM)
Average silhouette method
Bagging
Bag of words (BoW)
Bayesian statistics
Biological vs. artificial neuron
Bootstrapaggregation
Clustering
hierarchicalcluster
K-means
accuracy
average silhouette method
elbow method
expectation maximization
limitations
methods
text
k-means
LSA
singular value decomposition
source code
SVD
Collaborative filtering (CF)
Command-line installer
Convolution neural network (CNN)
Cross-industry standard process for data mining (CRISP-DM)
business
data gaps/relevance
deployment
evaluation
framework and phases
modeling
preparation
process diagram
Data assemble (text)
dataframe
get access key
pdf, jpg, and audio file
social media
textract formats
twitter authentication
DataFrame
Data mining
KDD
techniques
Data preprocessing (text)
bag of words
lemmatization
lower() function
n-grams
PoS tagging
removing noise
sentence tokenization
stemming
TF-IDF
word tokenization
Data science
Deep learning
ANN
Caffe
Keras
Lasagne
libraries
MXNet
Pylearn2
TensorFlow
Theano
Deep natural language processing (DNLP)
sopex package
Word2Vec
Descriptive analytics
Diagnostic analytics
dir() operation code
Document term matrix (DTM)
Elbow method
Ensemble methods
bagging
decision boundaries
ExtraTree
feature importance function
RandomForest
types of
Enterprise resource planning (ERP) systems
Exception handling
code flow
file operations
Python built-in
source code
try clause
Exploration (text)
co-occurrence matrix
frequency chart
lexical dispersion plot
word cloud
Exploratory data analysis (EDA)
Iris dataset
multivariate analysis
code creation
correlation matrix
findings values
pair plot
pandas dataframe visualization
univariate analysis
Extremely randomized trees (ExtraTree)
Feature engineering
construction/generation
handling categorical data
dummy variable creation
number conversion
logical flow
missing data
normalization and scaling
raw data
summarization methods
Generalized linear models (GLM)
Global positioning system (GPS)
Gradient boosting
GridSearch
Hard voting vs. soft voting
Hierarchical cluster technique
key parameters
maximum linkage
source code
Hyperparameter tuning
approach
GridSearch
RandomSearch
Identity operators
Input/output file
opening mode
operations
sequence
Join statement
inner
left
outer
right
K-folds cross-validation
classification model
holdout/single fold method
stratification
k nearest neighbors (kNN)
Knowledge discovery databases (KDD)
data mining
data mining process flow
interpretation/evaluation
preprocessing and cleaning
selection
stages
transformation techniques
Latent Dirichlet Allocation (LDA)
Latent semantic analysis (LSA)
Lemmatization
Linear regression vs. logistic regression
Logistic regression
GLM distribution
load data
model training and evaluation
multi-classes
normalize data
split data
Long short-term memory (LSTM)
Machine learning
AI
AI evolution
areas
categorization
CRISP-DM
data
attributes
comparison
continuous or quantitative
discrete/qualitative
fact and figures
interval scale
measurement scales
nominal level
ordinal scale
ratio scale
definitions
EDA
feature engineering
frameworks
history
KDD
libraries
ML imposters
overview
pattern recognition
process loop
prospect customer identification
Python
questions/hypothesis
recommendation system
regression
reinforcement
resources
robotic intelligent"
scikit-learn
SEMMA
simple models
spam detection
statsmodels
supervised learning
classification
regression
Turing test
unsupervised learning
clustering
dimension reduction
wheels from scratch
Matplotlib
Mean absolute error
Model building, text similarity
Model diagnosis and tuning
attributes
bias and variance
boosting
AdaBoosting process
ensemble voting
essential tuning parameters
gradient boosting
illustration
sklearn wrapper
stacking
xgboost
ensemble methods
bagging
decision boundaries
ExtraTree
feature importance function
RandomForest
types of
hyperparameter
k-fold cross-validation
probability cutoff point
class distribution
error message
functions
logistic regression model
optimal cutoff point
rare event/imbalanced dataset
disadvantages
handling techniques
make_classification function
re-sampling
variance
Moving average (MA)
Multivariate linear regression model
Multivariate regression
housing dataset (RDatasets)
multicollinearity and VIF
regression diagnostics
homoscedasticity test
linearity check
model fittings
outliers
over-fitting
under-fitting
Natural language processing (NLP)
N-grams
Nonlinear regression
Non-negative matrix factorization (NMF)
NumPy
arrays
broadcasting
built-in functions
indexing
boolean
field access
integer
slice syntax
types
mathematical functions
array math
sum function
transpose function
types
Object-oriented
bar plots–ax.bar() and ax.barh()
colomaps reference
customization
grid creation
horizontal bar charts
line plots–ax.plot()
line style and marker style
marker reference
matplotlib line style reference
multiple lines-different axis
multiple lines-same axis
pie chart–ax.pie()
plotting defaults
side-by-side bar chart
stacked bar charts
Pandas
DataFrame
data structures
grouping operation
join
merge/join
operations
pivot tables
reading and writing data
SQL/excel/R data frames
statistics
view function
Partial autocorrelation function (PACF)
Part of speech (PoS) tagging
Polynomial regression
Predictive analytics
Prescriptive analytics
Principal component analysis (PCA)
Problem types vs. potential ML algorithms
Python
code blocks
correct indentation
incorrect indentation
indentation
suites
control structure
iteration
loop control statement
selection statements
definition
dictionary
exception handling
file input/output
identifier
interactive
keywords
lists
module
mottos
multiline statements
NumPy and Pandas
object types
comments
list vs. tuple vs. set vs. dictionary
multiline comments
single line
operators
arithmetic operators
assignment operators
bitwise operators
comparison/relational operators
identity operators
logical operators
membership operators
types
vs. others
popular coding language
sets
tuple
accessing tuple
deleting items
operations
user-defined functions
2.7/3.4.x
Anaconda
graphical installer
Linux installation
official website
OSX installation
run command line
version
Windows installation
Python packages
customizing labels
data analysis
global functions
key packages
Matplotlib
NumPy
Pandas
libraries
object oriented
RandomForest
RandomSearch
Recommender systems
collaborative filtering (CF)
content-based filtering
types
Recurrent neural network (RNN)
Regression analysis
Regularization
Reinforcement learning
Restricted Boltzman Machines (RBM)
Robotic intelligent agent
components
definition
sensors and effectors
Turing test
Root mean squared error (RMSE)
Sample, explore, modify, model, assess (SEMMA)
assess
CRISP-DM and KDD
explore
frameworks
modeling/data mining
modify
sample
Sentiment analysis
Sets
accessing set elements
changing elements
code creation
difference
discard()/remove() method
intersection
key characteristics
operations
symmetric_difference()method
union
Stacking
Stochastic gradient descent algorithm
Supervised learning
cases
classification
confusion matrix
correlation and causation
decision trees
key parameters
model
nodes
splits and grows
stopping partition
fitting line
k nearest neighbors (kNN)
linear regression model
mean absolute error
metrics
RMSE
R-squared metrics
logistic regression
model performance classification
multiclass logistic regression
multivariate regression
coefficient
Durbin-Watson statistics
housing dataset (RDatasets)
hypothesis testing steps
multicollinearity and VIF
normal distribution
OLS regression results
regression diagnostics
results
R-squared value
standard error
t and p-value
nonlinear regression
plot sigmoid function
polynomial regression
process flow
regularization
ROC curve
scatter plot
slope line fitting
stochastic gradient descent
students score vs. hours
SVM
time-series forecasting
ARIMA
components
stationary time series
under-fitting, right-fitting, and over-fitting
Supervised learning algorithms
classification
regression
Support vector machine (SVM)
decision boundaries
equation
key objective
key parameters
symmetric_difference()method
Term document matrix (TDM)
Term frequency-inverse document frequency (TF-IDF)
Text mining
data assemble
datapreprocessing
DNLP
exploration
co-occurrence matrix
frequency chart
lexical dispersion plot
word cloud
libraries
model building
classification
clustering
document term matrix
Euclidian vs. cosine
sentiment analysis
text similarity
topic modeling
phases
process overview
Time-series forecasting
ARIMA
components
stationary time series
Transfer learning
Tuple, operations
Turing test
Unsupervised learning
clustering
PCA
process flow
User-defined functions
default argument
definition
functions with arguments
functions without argument
**kwargs
length arguments
passing argumens (*args)
variable/identifier
Word2Vec
Xgboost (eXtreme gradient boosting)