Data Science

Data Science

A statistical method used to compare two versions of a product or feature to determine which one performs better.

An unsupervised learning technique that groups similar data points together based on shared features.

A statistical measure that indicates the extent to which two variables fluctuate together.

A technique for assessing how the results of a statistical model will generalize to an independent dataset.

The process of detecting and correcting (or removing) inaccurate records from a dataset.

The process of discovering patterns and relationships in large datasets using statistical and machine learning techniques.

Summary statistics that quantitatively describe features of a dataset, such as mean, median, and standard deviation.

The process of selecting, modifying, or creating new variables to improve the performance of a machine learning model.

A method of making decisions or inferences about population parameters based on sample data.

A predictive modeling technique used to model the relationship between a dependent variable and one or more independent variables.

The process of assessing how well a predictive model performs, using metrics like accuracy, precision, recall, and F1-score.

A modeling error that occurs when a model learns noise in the training data rather than the actual patterns, reducing generalizability.

A statistical metric used to determine the significance of results obtained in hypothesis testing.

Using historical data, statistical algorithms, and machine learning to identify the likelihood of future outcomes.

A type of machine learning where the model is trained on labeled data to make predictions or classifications.

Want to explore more? Stay tuned for new terms and updates!

Last updated on 26 May, 2025