HDDS Data Science – HD Data Science

Objective 1

Evaluating generalized linear models

Diagnostic evaluation of multivariate least-squares regressions
Influential outlier detection
Interpreting coefficients, marginal effects, and odds-ratios

Objective 2

Machine Learning (Supervised)

Performance evaluation: overfitting, scoring best practices, impacts on business and strategic decisions
Cross-validation procedure, parameter tuning, regularization
Ensemble approaches: bagging, boosting, and stacking models
Model explainability and feature effects

Objective 3

Natural Language Processing

Regular expressions and tokenization
Categorization and tagging words: corpora, n-gram, and transformation-based tagging
Context-free, dependency, and feature based grammars
Word & sentence embeddings

Objective 4

Machine Learning (Unsupervised)

Factor Analysis & PCA: dimension selection, rotation effects, variance contribution
Cluster analysis: model advantages and limitations, use case feasibility, interpretation
Cluster analysis: evaluation metrics

Objective 5

Time Series & Forecasting

ARIMA models, diagnostics, and evaluation
Autocorrelation plots for lag selection, detection of stationarity
Unit-root tests
Forecast metrics and model selection

Objective 6

Deep Learning

Keras framework and operation
Deep Neural Networks – CNN, LSTM, and more
Transfer learning and pre-trained models
OpenCV and open-source computer vision models for object detection, facial recognition, and image processing

Struggling to assess your current and incoming data professionals?