Overview

Dataset statistics

Number of variables26
Number of observations1536
Missing cells1516
Missing cells (%)3.8%
Total size in memory312.1 KiB
Average record size in memory208.1 B

Variable types

Categorical25
Numeric1

Alerts

tipoCaso is highly imbalanced (86.1%)Imbalance
FORMACLIN1 is highly imbalanced (81.2%)Imbalance
classif is highly imbalanced (58.8%)Imbalance
BACOUTRO is highly imbalanced (69.2%)Imbalance
NECROP is highly imbalanced (97.4%)Imbalance
hiv is highly imbalanced (51.9%)Imbalance
DIABETES is highly imbalanced (66.0%)Imbalance
MENTAL is highly imbalanced (88.0%)Imbalance
HISTOPATOL is highly imbalanced (75.4%)Imbalance
motMudEsquema has 1516 (98.7%) missing valuesMissing
Status_Resistencia has 768 (50.0%) zerosZeros

Reproduction

Analysis started2023-10-31 18:15:45.547735
Analysis finished2023-10-31 18:15:45.745820
Duration0.2 seconds
Software versionpandas-profiling v3.6.6
Download configurationconfig.json

Variables

faixaEtaria
Categorical

Distinct12
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
20_29
387 
30_39
372 
40_49
323 
50_59
191 
60_69
90 
Other values (7)
173 

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st row20_29
2nd row40_49
3rd row40_49
4th row50_59
5th row30_39

Common Values

ValueCountFrequency (%)
20_29 387
25.2%
30_39 372
24.2%
40_49 323
21.0%
50_59 191
12.4%
60_69 90
 
5.9%
15_19 87
 
5.7%
70_79 35
 
2.3%
10_14 22
 
1.4%
05_09 11
 
0.7%
Maior de 80 anos 9
 
0.6%
Other values (2) 9
 
0.6%

sexo
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
M
1065 
F
471 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowM
2nd rowM
3rd rowM
4th rowM
5th rowM

Common Values

ValueCountFrequency (%)
M 1065
69.3%
F 471
30.7%

Common Values (Plot)

2023-10-31T15:15:45.925329image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

ESCOLARID
Categorical

Distinct6
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
De 8 a 11 anos
595 
De 4 a 7 anos
524 
De 1 a 3 anos
185 
De 12 a 14 anos
119 
Nenhuma
60 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowDe 4 a 7 anos
2nd rowDe 4 a 7 anos
3rd rowDe 8 a 11 anos
4th rowDe 4 a 7 anos
5th rowDe 4 a 7 anos

Common Values

ValueCountFrequency (%)
De 8 a 11 anos 595
38.7%
De 4 a 7 anos 524
34.1%
De 1 a 3 anos 185
 
12.0%
De 12 a 14 anos 119
 
7.7%
Nenhuma 60
 
3.9%
15 anos e mais 53
 
3.5%

Common Values (Plot)

2023-10-31T15:15:46.079583image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

TIPOCUP
Categorical

Distinct5
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
Outra
1007 
Desempregado
307 
Dona de Casa
108 
Aposentado
 
93
Profissional de Saude
 
21

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowOutra
2nd rowDesempregado
3rd rowOutra
4th rowOutra
5th rowOutra

Common Values

ValueCountFrequency (%)
Outra 1007
65.6%
Desempregado 307
 
20.0%
Dona de Casa 108
 
7.0%
Aposentado 93
 
6.1%
Profissional de Saude 21
 
1.4%

Common Values (Plot)

2023-10-31T15:15:46.253719image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

sitAtual
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
Cura
1308 
Abandono
228 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCura
2nd rowCura
3rd rowCura
4th rowCura
5th rowCura

Common Values

ValueCountFrequency (%)
Cura 1308
85.2%
Abandono 228
 
14.8%

Common Values (Plot)

2023-10-31T15:15:46.407372image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

tipoCaso
Categorical

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
Novo
1477 
Recidiva
 
32
Retr Aband
 
26
Retrat apos falencia/resistencia
 
1

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st rowNovo
2nd rowNovo
3rd rowNovo
4th rowNovo
5th rowNovo

Common Values

ValueCountFrequency (%)
Novo 1477
96.2%
Recidiva 32
 
2.1%
Retr Aband 26
 
1.7%
Retrat apos falencia/resistencia 1
 
0.1%

Common Values (Plot)

2023-10-31T15:15:46.551510image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

FORMACLIN1
Categorical

Distinct12
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
Pul
1387 
Pleural
 
85
Ganglionar Periferica
 
24
Meningea
 
8
Oftalmica
 
7
Other values (7)
 
25

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st rowPul
2nd rowPul
3rd rowPul
4th rowPul
5th rowPul

Common Values

ValueCountFrequency (%)
Pul 1387
90.3%
Pleural 85
 
5.5%
Ganglionar Periferica 24
 
1.6%
Meningea 8
 
0.5%
Oftalmica 7
 
0.5%
Outras 6
 
0.4%
Pele 5
 
0.3%
Miliar 5
 
0.3%
Multiplos Orgaos 3
 
0.2%
Ossea 3
 
0.2%
Other values (2) 3
 
0.2%

classif
Categorical

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
Pul
1281 
Ext
146 
P+E
 
106
Dissem
 
3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPul
2nd rowP+E
3rd rowP+E
4th rowP+E
5th rowP+E

Common Values

ValueCountFrequency (%)
Pul 1281
83.4%
Ext 146
 
9.5%
P+E 106
 
6.9%
Dissem 3
 
0.2%

Common Values (Plot)

2023-10-31T15:15:46.702881image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

descoberta
Categorical

Distinct6
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
Demanda Ambulatorial
701 
Urgencia / Emergencia
376 
Elucidacao Diagn. em Internacao
344 
Busca Ativa na Comunidade
 
46
Investigacao de Contatos
 
41

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowElucidacao Diagn. em Internacao
2nd rowDemanda Ambulatorial
3rd rowElucidacao Diagn. em Internacao
4th rowDemanda Ambulatorial
5th rowUrgencia / Emergencia

Common Values

ValueCountFrequency (%)
Demanda Ambulatorial 701
45.6%
Urgencia / Emergencia 376
24.5%
Elucidacao Diagn. em Internacao 344
22.4%
Busca Ativa na Comunidade 46
 
3.0%
Investigacao de Contatos 41
 
2.7%
Busca Ativa em Instituicao 28
 
1.8%

Common Values (Plot)

2023-10-31T15:15:46.874431image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

bac
Categorical

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
Pos
925 
Neg
352 
N/realiz
259 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPos
2nd rowPos
3rd rowN/realiz
4th rowPos
5th rowPos

Common Values

ValueCountFrequency (%)
Pos 925
60.2%
Neg 352
 
22.9%
N/realiz 259
 
16.9%

Common Values (Plot)

2023-10-31T15:15:47.103148image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

BACOUTRO
Categorical

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
N/realiz
1365 
Neg
 
103
Pos
 
67
And
 
1

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st rowN/realiz
2nd rowN/realiz
3rd rowPos
4th rowN/realiz
5th rowN/realiz

Common Values

ValueCountFrequency (%)
N/realiz 1365
88.9%
Neg 103
 
6.7%
Pos 67
 
4.4%
And 1
 
0.1%

Common Values (Plot)

2023-10-31T15:15:47.244841image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

cultEsc
Categorical

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
Pos
858 
N/realiz
529 
Neg
146 
And
 
3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPos
2nd rowPos
3rd rowN/realiz
4th rowPos
5th rowN/realiz

Common Values

ValueCountFrequency (%)
Pos 858
55.9%
N/realiz 529
34.4%
Neg 146
 
9.5%
And 3
 
0.2%

Common Values (Plot)

2023-10-31T15:15:47.388900image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

RX
Categorical

Distinct5
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
Susp TB
941 
Susp c/cavid
291 
N/realiz
208 
Normal
 
79
Outra Patologia
 
17

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSusp c/cavid
2nd rowSusp TB
3rd rowSusp TB
4th rowSusp TB
5th rowSusp TB

Common Values

ValueCountFrequency (%)
Susp TB 941
61.3%
Susp c/cavid 291
 
18.9%
N/realiz 208
 
13.5%
Normal 79
 
5.1%
Outra Patologia 17
 
1.1%

Common Values (Plot)

2023-10-31T15:15:47.543314image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

NECROP
Categorical

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
N/realiz
1530 
Sugestivo TB
 
4
BAAR pos
 
2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowN/realiz
2nd rowN/realiz
3rd rowN/realiz
4th rowN/realiz
5th rowN/realiz

Common Values

ValueCountFrequency (%)
N/realiz 1530
99.6%
Sugestivo TB 4
 
0.3%
BAAR pos 2
 
0.1%

Common Values (Plot)

2023-10-31T15:15:47.697709image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

hiv
Categorical

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
Neg
1200 
Pos
226 
N/realiz
 
109
And
 
1

Unique

Unique1 ?
Unique (%)0.1%

Sample

1st rowNeg
2nd rowNeg
3rd rowPos
4th rowPos
5th rowPos

Common Values

ValueCountFrequency (%)
Neg 1200
78.1%
Pos 226
 
14.7%
N/realiz 109
 
7.1%
And 1
 
0.1%

Common Values (Plot)

2023-10-31T15:15:47.836173image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

aids
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
N
1329 
S
207 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowN
2nd rowN
3rd rowS
4th rowS
5th rowS

Common Values

ValueCountFrequency (%)
N 1329
86.5%
S 207
 
13.5%

Common Values (Plot)

2023-10-31T15:15:47.971738image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

DIABETES
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
N
1439 
S
 
97

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowN
2nd rowN
3rd rowN
4th rowN
5th rowN

Common Values

ValueCountFrequency (%)
N 1439
93.7%
S 97
 
6.3%

Common Values (Plot)

2023-10-31T15:15:48.096592image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

ALCOOLISMO
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
N
1198 
S
338 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowN
2nd rowS
3rd rowN
4th rowN
5th rowN

Common Values

ValueCountFrequency (%)
N 1198
78.0%
S 338
 
22.0%

Common Values (Plot)

2023-10-31T15:15:48.220095image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

MENTAL
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
N
1511 
S
 
25

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowN
2nd rowN
3rd rowN
4th rowN
5th rowN

Common Values

ValueCountFrequency (%)
N 1511
98.4%
S 25
 
1.6%

Common Values (Plot)

2023-10-31T15:15:48.343957image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

DROGADICAO
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
N
1241 
S
295 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowN
2nd rowN
3rd rowN
4th rowN
5th rowS

Common Values

ValueCountFrequency (%)
N 1241
80.8%
S 295
 
19.2%

Common Values (Plot)

2023-10-31T15:15:48.606416image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

TABAGISMO
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
N
1231 
S
305 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowN
2nd rowS
3rd rowN
4th rowN
5th rowN

Common Values

ValueCountFrequency (%)
N 1231
80.1%
S 305
 
19.9%

Common Values (Plot)

2023-10-31T15:15:48.732538image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

motMudEsquema
Categorical

Distinct3
Distinct (%)15.0%
Missing1516
Missing (%)98.7%
Memory size12.1 KiB
Intolerancia/Toxicidade
13 
Resistencia Medicamentosa
Outro Motivo

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowResistencia Medicamentosa
2nd rowResistencia Medicamentosa
3rd rowResistencia Medicamentosa
4th rowResistencia Medicamentosa
5th rowIntolerancia/Toxicidade

Common Values

ValueCountFrequency (%)
Intolerancia/Toxicidade 13
 
0.8%
Resistencia Medicamentosa 5
 
0.3%
Outro Motivo 2
 
0.1%
(Missing) 1516
98.7%

Common Values (Plot)

2023-10-31T15:15:48.869056image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

tipoTrat
Categorical

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
Supervisionado
1130 
Auto-Administrado
406 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSupervisionado
2nd rowSupervisionado
3rd rowAuto-Administrado
4th rowSupervisionado
5th rowSupervisionado

Common Values

ValueCountFrequency (%)
Supervisionado 1130
73.6%
Auto-Administrado 406
 
26.4%

Common Values (Plot)

2023-10-31T15:15:49.014518image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

idade
Categorical

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
40_54
436 
23_39
420 
0_22
355 
Mais de 54
325 

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row23_39
2nd row40_54
3rd row40_54
4th rowMais de 54
5th row23_39

Common Values

ValueCountFrequency (%)
40_54 436
28.4%
23_39 420
27.3%
0_22 355
23.1%
Mais de 54 325
21.2%

Common Values (Plot)

2023-10-31T15:15:49.161096image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

HISTOPATOL
Categorical

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
N/realiz
1441 
Sugestivo TB
 
65
BAAR pos
 
30

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowN/realiz
2nd rowN/realiz
3rd rowBAAR pos
4th rowN/realiz
5th rowSugestivo TB

Common Values

ValueCountFrequency (%)
N/realiz 1441
93.8%
Sugestivo TB 65
 
4.2%
BAAR pos 30
 
2.0%

Common Values (Plot)

2023-10-31T15:15:49.305419image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

Status_Resistencia
Real number (ℝ)

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.5
Minimum0
Maximum1
Zeros768
Zeros (%)50.0%
Negative0
Negative (%)0.0%
Memory size12.1 KiB
2023-10-31T15:15:49.414781image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0.5
Q31
95-th percentile1
Maximum1
Range1
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.5001628399
Coefficient of variation (CV)1.00032568
Kurtosis-2.002609263
Mean0.5
Median Absolute Deviation (MAD)0.5
Skewness0
Sum768
Variance0.2501628664
MonotonicityNot monotonic
2023-10-31T15:15:49.523416image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram with fixed size bins (bins=2)
ValueCountFrequency (%)
1 768
50.0%
0 768
50.0%
ValueCountFrequency (%)
0 768
50.0%
1 768
50.0%
ValueCountFrequency (%)
1 768
50.0%
0 768
50.0%