Confusion Matrix Calculator

Calculate confusion matrix metrics instantly online. Get accuracy, precision, recall, specificity, F1 score, MCC, and more for machine learning classification models.

🧩

Confusion Matrix Calculator

Enter confusion matrix values (TP, FP, FN, TN) to instantly calculate all classification metrics — accuracy, precision, recall, F1, MCC, and more. Upload a CSV for automatic matrix generation. All calculations run locally in your browser.

Confusion Matrix Values

True Positive (TP)

Correctly predicted positive

False Negative (FN)

Missed positive (Type II)

False Positive (FP)

Wrong positive (Type I)

True Negative (TN)

Correctly predicted negative

Total samples200

Ctrl+Enter to recalculate

Confusion Matrix

← Predicted Pos | Predicted Neg →

45.0%

10.0%

5.0%

40.0%

↑ Actual Pos↓ Actual Neg

Example Scenarios

Key Metrics

85.00%

Accuracy

90.00%

Precision

81.82%

Recall

F1 Score85.71%

Specificity88.89%

MCC0.7035

Total samples: 200Positives: 110 · Negatives: 90

Accuracy

85.00%

Fraction of all predictions that were correct.

Precision

90.00%

Of all predicted positives, how many were correct?

Recall

81.82%

Of all actual positives, how many were found?

Specificity

88.89%

Of all actual negatives, how many were correctly identified?

F1 Score

85.71%

Harmonic mean of precision and recall.

FPR

11.11%

False Positive Rate (1 − Specificity).

FNR

18.18%

False Negative Rate (1 − Recall).

NPV

80.00%

Negative Predictive Value — accuracy of negative predictions.

Balanced Acc.

85.35%

(Recall + Specificity) / 2. Fair for imbalanced classes.

MCC

0.7035

Matthews Correlation Coefficient (−1 to +1).

Metrics Overview

Accuracy85.00%

Precision90.00%

Recall81.82%

Specificity88.89%

F1 Score85.71%

Balanced Accuracy85.35%

What Is a Confusion Matrix?

A confusion matrix is a table that summarises the performance of a classification model by comparing actual labels against predicted labels. For binary classification, it contains four values: True Positives (TP), True Negatives (TN), False Positives (FP), and False Negatives (FN).

From these four numbers you can derive every standard classification metric — accuracy, precision, recall, F1 score, specificity, MCC, and more — giving you a complete picture of how well a model performs across both positive and negative classes.

Confusion Matrix Layout

                  Predicted Positive    Predicted Negative
Actual Positive        TP (True+)            FN (False-)
Actual Negative        FP (False+)           TN (True-)

TP = Correctly predicted positive
TN = Correctly predicted negative
FP = Incorrectly predicted positive (Type I error)
FN = Incorrectly predicted negative (Type II error)

All Metric Formulas

Accuracy          = (TP + TN) / (TP + FP + FN + TN)
Precision         = TP / (TP + FP)
Recall            = TP / (TP + FN)
Specificity       = TN / (TN + FP)
F1 Score          = 2 × (Precision × Recall) / (Precision + Recall)
False Positive Rate (FPR) = FP / (FP + TN)
False Negative Rate (FNR) = FN / (FN + TP)
NPV               = TN / (TN + FN)
Balanced Accuracy = (Recall + Specificity) / 2
MCC               = (TP×TN − FP×FN) / √((TP+FP)(TP+FN)(TN+FP)(TN+FN))

Example: TP=90, TN=80, FP=10, FN=20
  Accuracy   = (90 + 80) / 200 = 85.00%
  Precision  = 90 / 100        = 90.00%
  Recall     = 90 / 110        = 81.82%
  F1 Score   = 2×(0.90×0.818)/(0.90+0.818) = 85.71%

Metric Reference Guide

Metric	Formula	Best For
Accuracy	(TP+TN)/Total	Balanced datasets
Precision	TP/(TP+FP)	When FP is costly (e.g. spam)
Recall	TP/(TP+FN)	When FN is costly (e.g. medical)
Specificity	TN/(TN+FP)	Negative class performance
F1 Score	2×P×R/(P+R)	Imbalanced datasets
FPR	FP/(FP+TN)	ROC curve analysis
FNR	FN/(FN+TP)	Miss rate analysis
NPV	TN/(TN+FN)	Negative prediction reliability
Balanced Accuracy	(Recall+Specificity)/2	Class-imbalanced evaluation
MCC	(TP×TN−FP×FN)/√(…)	Overall quality (−1 to +1)

When to Use Each Metric

Use Accuracy when…

Your dataset is balanced (roughly equal class sizes) and all misclassification types carry equal cost.

Use Precision when…

False positives are expensive. In spam detection, flagging a legitimate email as spam (FP) damages user trust more than missing a spam email.

Use Recall when…

False negatives are dangerous. In cancer screening, missing a true positive (FN) has far greater consequences than a false alarm.

Use F1 Score when…

Your dataset is imbalanced and you need a single metric that balances precision and recall equally.

Use MCC when…

You want a single comprehensive metric that accounts for all four quadrants of the confusion matrix, especially for heavily imbalanced datasets.

Frequently Asked Questions

What does a confusion matrix tell you?

It breaks down all prediction outcomes into four categories — TP, TN, FP, FN — letting you see exactly where your model succeeds and where it fails. From these you can calculate every classification metric without needing the raw predictions.

What is a good accuracy for a classification model?

It depends heavily on the dataset. For balanced datasets, 90%+ is typically good. For heavily imbalanced datasets, accuracy can be misleading — a model predicting the majority class 100% of the time could score 99% accuracy while being completely useless.

What is the difference between sensitivity and specificity?

Sensitivity (recall) measures how well the model finds actual positives. Specificity measures how well it correctly identifies actual negatives. A good diagnostic test aims for high values of both.

Why is MCC considered a better metric than F1?

MCC (Matthews Correlation Coefficient) uses all four values of the confusion matrix and is not inflated by class imbalance. F1 ignores true negatives entirely. For datasets where TN is large (common in fraud detection), MCC gives a more balanced assessment.

Can I upload predictions directly?

Yes. Use the CSV Upload mode and provide a two-column CSV with columns 'actual' and 'predicted'. The calculator parses binary values (1/0) or string labels (positive/negative, yes/no) and builds the confusion matrix automatically.

Related Tools

📊

Precision Recall Calculator

Calculate precision, recall, F1 score, accuracy, and specificity from confusion matrix values. Free online machine learning evaluation metrics calculator.

Try it now→

🧮

F1 Score Calculator

Calculate F1 score instantly using confusion matrix or precision and recall values. Free online F1 score calculator for AI, machine learning, classification, and data science.

Try it now→

🎯

Model Accuracy Calculator

Calculate machine learning model accuracy instantly. Compare actual vs predicted labels, evaluate AI performance, upload CSV data, and get instant results online for free.

Try it now→

🤖

AI Token Cost Calculator

Estimate AI API token costs for OpenAI, Claude, Gemini, and custom models. Calculate prompt and completion token expenses, compare models, and forecast monthly and yearly costs.

Try it now→

📏

AI Prompt Length Calculator

Calculate AI prompt length instantly. Count tokens, words, characters, sentences, and estimate context window usage for ChatGPT, Claude, Gemini, and other AI models.

Try it now→

📊

Time Complexity Calculator

Estimate algorithm time complexity using Big-O notation. Analyze loop patterns, recursion, and algorithm presets with interactive growth visualizations and educational explanations.

Try it now→

Confusion Matrix Calculator

Confusion Matrix Values

Confusion Matrix

Example Scenarios

Metrics Overview

Step-by-Step Formulas

What Is a Confusion Matrix?

Confusion Matrix Layout

All Metric Formulas

Metric Reference Guide

When to Use Each Metric

Use Accuracy when…

Use Precision when…

Use Recall when…

Use F1 Score when…

Use MCC when…

Frequently Asked Questions

What does a confusion matrix tell you?

What is a good accuracy for a classification model?

What is the difference between sensitivity and specificity?

Why is MCC considered a better metric than F1?

Can I upload predictions directly?

Related Tools

Precision Recall Calculator

F1 Score Calculator

Model Accuracy Calculator

AI Token Cost Calculator

AI Prompt Length Calculator

Time Complexity Calculator