File: paper.md

package info (click to toggle)
python-pycm 4.3-1
  • links: PTS, VCS
  • area: main
  • in suites: trixie
  • size: 3,952 kB
  • sloc: python: 5,029; sh: 8; makefile: 6
file content (56 lines) | stat: -rw-r--r-- 2,107 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
---
title: 'PyCM: Multiclass confusion matrix library in Python'
tags:
  - confusion-matrix
  - classification
  - statistics
  - statistical-analysis
  - analysis
  - machine-learning
  - data-analysis
  - python
authors:
 - name: Sepand Haghighi
   orcid: 0000-0001-9450-2375
   affiliation: 1
 - name: Masoomeh Jasemi
   orcid: 0000-0002-4831-1698
   affiliation: 1
 - name: Shaahin Hessabi
   orcid: 0000-0003-3193-2567
   affiliation: 1
 - name: Alireza Zolanvari
   orcid: 0000-0003-2367-8343
   affiliation: 2
affiliations:
 - name: Sharif University of Technology
   index: 1
 - name: Amirkabir University of Technology
   index: 2

date: 29 April 2018
bibliography: paper.bib
---
						

# Summary			
			
In the field of machine learning and specifically for statistical classification a confusion matrix - also known as error matrix - is a specific table layout that allows visualization of the algorithm performance, and is mostly used in supervised learning. Each row of the matrix represents the instances in an actual class, while each column represents the instances in a predict class (or vice versa)[@Powers2011; @Sammut2010; @Shepperd2014; @Deng2016].			
			

PyCM is a multi-class confusion matrix library written in Python that supports both input data vectors and direct matrix, and a proper tool for post-classification model evaluation that supports most classes and overall statistics parameters [@Landis1977; @Fleiss1971; @Altman1990; @Gwet2008; @Scott1955; @Bennett1954; @Cicchetti1994; @Davies1980; @Kullback1951; @Goodman1972; @Goodman1963; @Byrt1993].			

We can categorize these statistics in 3 sections: 			

1. Basic
2. Class Statistics
3. Overall Statistics

PyCM is also capable of generating report in HTML, CSV and .pycm formats.

To sum it up, PyCM is the swiss-army knife of confusion matrices, targeted mainly at data scientists that need a broad array of metrics for predictive models and an accurate evaluation of large variety of classifiers [@Haghighi2018].


Website : [http://pycm.shaghighi.ir](http://pycm.shaghighi.ir "http://pycm.shaghighi.ir")

# References