
Unstructured Data - OpenCV, Image processing, Smoothening, Morphological Operations, NLTK, Text processing. Data from Web API, Scraping, Data Cleaning. Relational, Non-relational, ER diagrams, SQL Commands, Aggregate Functions, Joins, SubQueries, Normalisation, Scaling patterns, ACID, Dask SQL, Cloud SQL (Athena/BigQuery). Vector and Matrices, Unit Vector, Dot product, Projections, Cosine Similarity, Determinant, Transpose, System of Equations. DF, PMF, CDF, PPF, Uniform, Gaussian, Bernoulli, Multinomial, Normal Distribution, Poisson, Exponential, Geometric, Log-normal distribution, Pareto/Power Law Distribution. Combinatorics, Marginal Probability, Joint Probability, Conditional Probability, Bayes Theorem, Mean, Median, Mode, Percentile, IQR, Outlier.
Probability Theory and Desciptive Statistics.Data Visualisation using Matplotlib and Seaborn.