recall_score: Classification Recall Metric¶

The recall_score function computes the recall of classification models, supporting both binary and multiclass settings. Recall measures the proportion of actual positives that were correctly identified, making it a key metric for evaluating classifiers, especially when the cost of false negatives is high.

Overview¶

Recall is defined as the ratio of true positives (TP) to the sum of true positives and false negatives (FN).

This metric is especially important in scenarios where false negatives are more costly than false positives (e.g., disease screening, fraud detection).

Parameters¶

Parameter	Type	Default	Description
`y_true`	array-like or pandas Series	—	Ground truth (correct) target values. Shape: (n_samples,)
`y_pred`	array-like or pandas Series	—	Estimated target values as returned by a classifier. Shape: (n_samples,)
`average`	{'binary', 'micro', 'macro', 'weighted', None}	'binary'	Determines the type of averaging performed on the data. See below for details.
`labels`	array-like or None	None	List of labels to include. If None, uses sorted unique labels from y_true and y_pred.
`verbose`	bool	True	Print detailed progress, warnings, and results

Averaging Options¶

'binary': Only report results for the positive class (default for binary classification).
'micro': Calculate metrics globally by counting the total true positives, false negatives, and false positives.
'macro': Calculate metrics for each label, and find their unweighted mean.
'weighted': Calculate metrics for each label, and find their average weighted by support (the number of true instances for each label).
None: Return the recall for each class as an array.

Returns¶

recall: float or array of floats Recall score(s). Returns a float if average is not None, otherwise returns an array of recall values for each class.

Raises¶

ValueError
If y_true or y_pred is a pandas DataFrame (must select a column).
If the shapes of y_true and y_pred do not match.
If average='binary' but the problem is not binary classification.
If average is not a recognized option.

Example Usage¶

from machinegnostics.metrics import recall_score

# Example 1: Macro-averaged recall for multiclass
y_true = [0, 1, 2, 2, 0]
y_pred = [0, 0, 2, 2, 0]
print(recall_score(y_true, y_pred, average='macro'))

# Example 2: Binary recall with pandas Series
import pandas as pd
df = pd.DataFrame({'true': [1, 0, 1], 'pred': [1, 1, 1]})
print(recall_score(df['true'], df['pred'], average='binary'))

Notes¶

The function supports input as numpy arrays, lists, or pandas Series.
If you pass a pandas DataFrame, you must select a column (e.g., df['col']), not the whole DataFrame.
For binary classification, by convention, the second label is treated as the positive class.
For imbalanced datasets, consider using average='weighted' to account for class support.