Word Error Rate (WER) measures the edit distance between predicted and reference transcriptions
Image: CC BY-SA 3.0, via Wikimedia Commons
Word Error Rate (WER) measures the edit distance between predicted and reference transcriptions
BLEU vs ROUGE: BLEU measures precision of n-grams, ROUGE measures recall
BLEU measures precision of n-grams, ROUGE measures recall
the vocabulary size matters: larger vocab = shorter sequences but more parameters
Larger vocab reduces sequence length, increasing model complexity and parameters
Brier score
Brier score measures mean squared error of probability predictions
WordPiece tokenization does: similar to BPE but uses likelihood instead of frequency
WordPiece tokenization splits words into subwords based on token likelihood rather than frequency
log-loss / cross-entropy loss penalizes: confident wrong predictions more heavily
Log-loss penalizes confident incorrect predictions more heavily
to normalize features: when features have different scales and you use distance-based methods
Normalize features when they have different scales for distance-based methods
One email a day: 5 concepts + the 5 stories that matter →
Swipe through 100 ML concepts daily
Open TickerNews