
Cohen's D benchmarks: 0.2 = small, 0.5 = medium, 0.8 = large effect
Image: Tovarg, CC BY-SA 4.0, via Wikimedia Commons
Cohen's D benchmarks: 0.2 = small, 0.5 = medium, 0.8 = large effect
Effect size is a quantitative measure of the magnitude of a phenomenon in statistics. It can refer to the value of a statistic calculated from a sample of data, the value of one parameter for a hypothetical population, or the equation that operationalizes how statistics or parameters lead to the effect size value. Examples of effect sizes include the correlation between two variables, the regression coefficient in a regression, the mean difference, and the risk of a particular event (such as a heart attack).
Effect sizes are a complementary tool for statistical hypothesis testing and play an important role in statistical power analyses to assess the sample size required for new experiments. They are also fundamental to meta-analysis, which aims to provide the combined effect size based on data from multiple studies. Effect size calculations are essential for evaluating the strength of a statistical claim and are the first item in the MAGIC criteria.
The standard deviation of the effect size is of critical importance, as it indicates how much uncertainty there is in the effect size calculation. This standard deviation helps in understanding the precision of the effect size and its reliability in different studies or experiments.
Example
In a study measuring the effectiveness of a new drug, the effect size (Cohen's D) was found to be 0.5, indicating a medium effect. This means that the drug has a moderate impact on the outcome being measured.
Understanding Cohen's D benchmarks helps researchers and statisticians interpret the magnitude of effect sizes in their studies, ensuring accurate and meaningful conclusions.
batch size affects generalization: larger batches find sharper minima
Larger batch sizes lead to sharper minima, enhancing generalization by providing more accurate gradient estimates
L1 vs L2 regularization: L1 gives sparsity (feature selection), L2 gives small weights
L1 regularization: L1 = L2 + sparsity; L2 regularization: L2 = L1 + small weights
quantization to INT8 doubles throughput
Quantization to INT8 doubles throughput because tensor cores process INT8 2x faster
BFS vs DFS: BFS finds shortest path in unweighted graphs, DFS uses less memory
BFS finds shortest path in unweighted graphs; DFS uses less memory
Bias vs variance: high bias = underfitting, high variance = overfitting
High bias = underfitting, high variance = overfitting
the vocabulary size matters: larger vocab = shorter sequences but more parameters
Larger vocab reduces sequence length, increasing model complexity and parameters
One email a day: 5 concepts + the 5 stories that matter →
Swipe through 100 ML concepts daily
Open TickerNews