Overview

Dataset statistics

Number of variables28
Number of observations4
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1000.0 B
Average record size in memory250.0 B

Variable types

Categorical25
Boolean3

Alerts

sexo has constant value "M"Constant
TIPOCUP has constant value "Outra"Constant
tipoCaso has constant value "Novo"Constant
cultEsc has constant value "N/realiz"Constant
NECROP has constant value "N/realiz"Constant
aids has constant value "False"Constant
DIABETES has constant value "False"Constant
MENTAL has constant value "False"Constant
motMudEsquema has constant value "Nulo"Constant
tipoTrat has constant value "Supervisionado"Constant
HISTOPATOL has constant value "N/realiz"Constant
Status_Resistencia has constant value "1"Constant
Cluster has constant value "2"Constant
faixaEtaria is highly overall correlated with bac and 4 other fieldsHigh correlation
ESCOLARID is highly overall correlated with sitAtual and 3 other fieldsHigh correlation
sitAtual is highly overall correlated with ESCOLARID and 2 other fieldsHigh correlation
FORMACLIN1 is highly overall correlated with classif and 3 other fieldsHigh correlation
classif is highly overall correlated with FORMACLIN1 and 3 other fieldsHigh correlation
descoberta is highly overall correlated with sitAtual and 4 other fieldsHigh correlation
bac is highly overall correlated with faixaEtaria and 1 other fieldsHigh correlation
BACOUTRO is highly overall correlated with DROGADICAO and 2 other fieldsHigh correlation
RX is highly overall correlated with FORMACLIN1 and 3 other fieldsHigh correlation
hiv is highly overall correlated with faixaEtaria and 5 other fieldsHigh correlation
ALCOOLISMO is highly overall correlated with faixaEtaria and 1 other fieldsHigh correlation
DROGADICAO is highly overall correlated with descoberta and 2 other fieldsHigh correlation
TABAGISMO is highly overall correlated with ESCOLARID and 2 other fieldsHigh correlation
idade is highly overall correlated with faixaEtaria and 3 other fieldsHigh correlation
Probabilidade is highly overall correlated with faixaEtaria and 13 other fieldsHigh correlation
bac is uniformly distributedUniform
ALCOOLISMO is uniformly distributedUniform
DROGADICAO is uniformly distributedUniform
Probabilidade is uniformly distributedUniform
Probabilidade has unique valuesUnique

Reproduction

Analysis started2023-10-31 19:44:32.874711
Analysis finished2023-10-31 19:44:35.297205
Duration2.42 seconds
Software versionpandas-profiling v3.6.6
Download configurationconfig.json

Variables

faixaEtaria
Categorical

Distinct3
Distinct (%)75.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
30_39
60_69
20_29

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters20
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)50.0%

Sample

1st row60_69
2nd row30_39
3rd row20_29
4th row30_39

Common Values

ValueCountFrequency (%)
30_39 2
50.0%
60_69 1
25.0%
20_29 1
25.0%

Length

2023-10-31T16:44:35.370350image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:35.522906image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
30_39 2
50.0%
60_69 1
25.0%
20_29 1
25.0%

Most occurring characters

ValueCountFrequency (%)
3 4
20.0%
0 4
20.0%
_ 4
20.0%
9 4
20.0%
6 2
10.0%
2 2
10.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 16
80.0%
Connector Punctuation 4
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 4
25.0%
0 4
25.0%
9 4
25.0%
6 2
12.5%
2 2
12.5%
Connector Punctuation
ValueCountFrequency (%)
_ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 20
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 4
20.0%
0 4
20.0%
_ 4
20.0%
9 4
20.0%
6 2
10.0%
2 2
10.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 20
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 4
20.0%
0 4
20.0%
_ 4
20.0%
9 4
20.0%
6 2
10.0%
2 2
10.0%

sexo
Categorical

Distinct1
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size160.0 B
M

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters4
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowM
2nd rowM
3rd rowM
4th rowM

Common Values

ValueCountFrequency (%)
M 4
100.0%

Length

2023-10-31T16:44:35.645820image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:35.781514image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
m 4
100.0%

Most occurring characters

ValueCountFrequency (%)
M 4
100.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 4
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
M 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
M 4
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
M 4
100.0%

ESCOLARID
Categorical

Distinct3
Distinct (%)75.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
De 8 a 11 anos
Nenhuma
De 4 a 7 anos

Length

Max length14
Median length13.5
Mean length12
Min length7

Characters and Unicode

Total characters48
Distinct characters15
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)50.0%

Sample

1st rowNenhuma
2nd rowDe 4 a 7 anos
3rd rowDe 8 a 11 anos
4th rowDe 8 a 11 anos

Common Values

ValueCountFrequency (%)
De 8 a 11 anos 2
50.0%
Nenhuma 1
25.0%
De 4 a 7 anos 1
25.0%

Length

2023-10-31T16:44:35.900285image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:36.062652image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
de 3
18.8%
a 3
18.8%
anos 3
18.8%
8 2
12.5%
11 2
12.5%
nenhuma 1
 
6.2%
4 1
 
6.2%
7 1
 
6.2%

Most occurring characters

ValueCountFrequency (%)
12
25.0%
a 7
14.6%
e 4
 
8.3%
1 4
 
8.3%
n 4
 
8.3%
D 3
 
6.2%
o 3
 
6.2%
s 3
 
6.2%
8 2
 
4.2%
N 1
 
2.1%
Other values (5) 5
10.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 24
50.0%
Space Separator 12
25.0%
Decimal Number 8
 
16.7%
Uppercase Letter 4
 
8.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 7
29.2%
e 4
16.7%
n 4
16.7%
o 3
12.5%
s 3
12.5%
h 1
 
4.2%
u 1
 
4.2%
m 1
 
4.2%
Decimal Number
ValueCountFrequency (%)
1 4
50.0%
8 2
25.0%
4 1
 
12.5%
7 1
 
12.5%
Uppercase Letter
ValueCountFrequency (%)
D 3
75.0%
N 1
 
25.0%
Space Separator
ValueCountFrequency (%)
12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 28
58.3%
Common 20
41.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 7
25.0%
e 4
14.3%
n 4
14.3%
D 3
10.7%
o 3
10.7%
s 3
10.7%
N 1
 
3.6%
h 1
 
3.6%
u 1
 
3.6%
m 1
 
3.6%
Common
ValueCountFrequency (%)
12
60.0%
1 4
 
20.0%
8 2
 
10.0%
4 1
 
5.0%
7 1
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 48
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12
25.0%
a 7
14.6%
e 4
 
8.3%
1 4
 
8.3%
n 4
 
8.3%
D 3
 
6.2%
o 3
 
6.2%
s 3
 
6.2%
8 2
 
4.2%
N 1
 
2.1%
Other values (5) 5
10.4%

TIPOCUP
Categorical

Distinct1
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
Outra

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters20
Distinct characters5
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowOutra
2nd rowOutra
3rd rowOutra
4th rowOutra

Common Values

ValueCountFrequency (%)
Outra 4
100.0%

Length

2023-10-31T16:44:36.187508image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:36.323086image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
outra 4
100.0%

Most occurring characters

ValueCountFrequency (%)
O 4
20.0%
u 4
20.0%
t 4
20.0%
r 4
20.0%
a 4
20.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 16
80.0%
Uppercase Letter 4
 
20.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
u 4
25.0%
t 4
25.0%
r 4
25.0%
a 4
25.0%
Uppercase Letter
ValueCountFrequency (%)
O 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 20
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
O 4
20.0%
u 4
20.0%
t 4
20.0%
r 4
20.0%
a 4
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 20
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
O 4
20.0%
u 4
20.0%
t 4
20.0%
r 4
20.0%
a 4
20.0%

sitAtual
Categorical

Distinct2
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
Cura
Abandono

Length

Max length8
Median length4
Mean length5
Min length4

Characters and Unicode

Total characters20
Distinct characters9
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)25.0%

Sample

1st rowCura
2nd rowAbandono
3rd rowCura
4th rowCura

Common Values

ValueCountFrequency (%)
Cura 3
75.0%
Abandono 1
 
25.0%

Length

2023-10-31T16:44:36.452979image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:36.616083image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
cura 3
75.0%
abandono 1
 
25.0%

Most occurring characters

ValueCountFrequency (%)
a 4
20.0%
C 3
15.0%
u 3
15.0%
r 3
15.0%
n 2
10.0%
o 2
10.0%
A 1
 
5.0%
b 1
 
5.0%
d 1
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 16
80.0%
Uppercase Letter 4
 
20.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 4
25.0%
u 3
18.8%
r 3
18.8%
n 2
12.5%
o 2
12.5%
b 1
 
6.2%
d 1
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
C 3
75.0%
A 1
 
25.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 20
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 4
20.0%
C 3
15.0%
u 3
15.0%
r 3
15.0%
n 2
10.0%
o 2
10.0%
A 1
 
5.0%
b 1
 
5.0%
d 1
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 20
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 4
20.0%
C 3
15.0%
u 3
15.0%
r 3
15.0%
n 2
10.0%
o 2
10.0%
A 1
 
5.0%
b 1
 
5.0%
d 1
 
5.0%

tipoCaso
Categorical

Distinct1
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
Novo

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters16
Distinct characters3
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNovo
2nd rowNovo
3rd rowNovo
4th rowNovo

Common Values

ValueCountFrequency (%)
Novo 4
100.0%

Length

2023-10-31T16:44:36.734805image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:36.920382image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
novo 4
100.0%

Most occurring characters

ValueCountFrequency (%)
o 8
50.0%
N 4
25.0%
v 4
25.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 12
75.0%
Uppercase Letter 4
 
25.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 8
66.7%
v 4
33.3%
Uppercase Letter
ValueCountFrequency (%)
N 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 16
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 8
50.0%
N 4
25.0%
v 4
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 16
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 8
50.0%
N 4
25.0%
v 4
25.0%

FORMACLIN1
Categorical

Distinct3
Distinct (%)75.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
Pul
Multiplos Orgaos
Pleural

Length

Max length16
Median length11.5
Mean length7.25
Min length3

Characters and Unicode

Total characters29
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)50.0%

Sample

1st rowPul
2nd rowPul
3rd rowMultiplos Orgaos
4th rowPleural

Common Values

ValueCountFrequency (%)
Pul 2
50.0%
Multiplos Orgaos 1
25.0%
Pleural 1
25.0%

Length

2023-10-31T16:44:37.037403image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:37.197592image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
pul 2
40.0%
multiplos 1
20.0%
orgaos 1
20.0%
pleural 1
20.0%

Most occurring characters

ValueCountFrequency (%)
l 6
20.7%
u 4
13.8%
P 3
10.3%
o 2
 
6.9%
s 2
 
6.9%
r 2
 
6.9%
a 2
 
6.9%
M 1
 
3.4%
t 1
 
3.4%
i 1
 
3.4%
Other values (5) 5
17.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 23
79.3%
Uppercase Letter 5
 
17.2%
Space Separator 1
 
3.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
l 6
26.1%
u 4
17.4%
o 2
 
8.7%
s 2
 
8.7%
r 2
 
8.7%
a 2
 
8.7%
t 1
 
4.3%
i 1
 
4.3%
p 1
 
4.3%
g 1
 
4.3%
Uppercase Letter
ValueCountFrequency (%)
P 3
60.0%
M 1
 
20.0%
O 1
 
20.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 28
96.6%
Common 1
 
3.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
l 6
21.4%
u 4
14.3%
P 3
10.7%
o 2
 
7.1%
s 2
 
7.1%
r 2
 
7.1%
a 2
 
7.1%
M 1
 
3.6%
t 1
 
3.6%
i 1
 
3.6%
Other values (4) 4
14.3%
Common
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 29
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
l 6
20.7%
u 4
13.8%
P 3
10.3%
o 2
 
6.9%
s 2
 
6.9%
r 2
 
6.9%
a 2
 
6.9%
M 1
 
3.4%
t 1
 
3.4%
i 1
 
3.4%
Other values (5) 5
17.2%

classif
Categorical

Distinct3
Distinct (%)75.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
Pul
Dissem
Ext

Length

Max length6
Median length3
Mean length3.75
Min length3

Characters and Unicode

Total characters15
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)50.0%

Sample

1st rowPul
2nd rowPul
3rd rowDissem
4th rowExt

Common Values

ValueCountFrequency (%)
Pul 2
50.0%
Dissem 1
25.0%
Ext 1
25.0%

Length

2023-10-31T16:44:37.330713image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:37.491467image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
pul 2
50.0%
dissem 1
25.0%
ext 1
25.0%

Most occurring characters

ValueCountFrequency (%)
P 2
13.3%
u 2
13.3%
l 2
13.3%
s 2
13.3%
D 1
6.7%
i 1
6.7%
e 1
6.7%
m 1
6.7%
E 1
6.7%
x 1
6.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 11
73.3%
Uppercase Letter 4
 
26.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
u 2
18.2%
l 2
18.2%
s 2
18.2%
i 1
9.1%
e 1
9.1%
m 1
9.1%
x 1
9.1%
t 1
9.1%
Uppercase Letter
ValueCountFrequency (%)
P 2
50.0%
D 1
25.0%
E 1
25.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 15
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
P 2
13.3%
u 2
13.3%
l 2
13.3%
s 2
13.3%
D 1
6.7%
i 1
6.7%
e 1
6.7%
m 1
6.7%
E 1
6.7%
x 1
6.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 15
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
P 2
13.3%
u 2
13.3%
l 2
13.3%
s 2
13.3%
D 1
6.7%
i 1
6.7%
e 1
6.7%
m 1
6.7%
E 1
6.7%
x 1
6.7%

descoberta
Categorical

Distinct3
Distinct (%)75.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
Elucidacao Diagn. em Internacao
Demanda Ambulatorial
Urgencia / Emergencia

Length

Max length31
Median length26
Mean length25.75
Min length20

Characters and Unicode

Total characters103
Distinct characters22
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)50.0%

Sample

1st rowElucidacao Diagn. em Internacao
2nd rowDemanda Ambulatorial
3rd rowUrgencia / Emergencia
4th rowElucidacao Diagn. em Internacao

Common Values

ValueCountFrequency (%)
Elucidacao Diagn. em Internacao 2
50.0%
Demanda Ambulatorial 1
25.0%
Urgencia / Emergencia 1
25.0%

Length

2023-10-31T16:44:37.618405image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:37.769667image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
elucidacao 2
15.4%
diagn 2
15.4%
em 2
15.4%
internacao 2
15.4%
demanda 1
7.7%
ambulatorial 1
7.7%
urgencia 1
7.7%
1
7.7%
emergencia 1
7.7%

Most occurring characters

ValueCountFrequency (%)
a 16
15.5%
n 9
 
8.7%
9
 
8.7%
e 8
 
7.8%
c 8
 
7.8%
i 7
 
6.8%
r 5
 
4.9%
o 5
 
4.9%
m 5
 
4.9%
g 4
 
3.9%
Other values (12) 27
26.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 81
78.6%
Uppercase Letter 10
 
9.7%
Space Separator 9
 
8.7%
Other Punctuation 3
 
2.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 16
19.8%
n 9
11.1%
e 8
9.9%
c 8
9.9%
i 7
8.6%
r 5
 
6.2%
o 5
 
6.2%
m 5
 
6.2%
g 4
 
4.9%
l 4
 
4.9%
Other values (4) 10
12.3%
Uppercase Letter
ValueCountFrequency (%)
E 3
30.0%
D 3
30.0%
I 2
20.0%
A 1
 
10.0%
U 1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
. 2
66.7%
/ 1
33.3%
Space Separator
ValueCountFrequency (%)
9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 91
88.3%
Common 12
 
11.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 16
17.6%
n 9
9.9%
e 8
 
8.8%
c 8
 
8.8%
i 7
 
7.7%
r 5
 
5.5%
o 5
 
5.5%
m 5
 
5.5%
g 4
 
4.4%
l 4
 
4.4%
Other values (9) 20
22.0%
Common
ValueCountFrequency (%)
9
75.0%
. 2
 
16.7%
/ 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 103
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 16
15.5%
n 9
 
8.7%
9
 
8.7%
e 8
 
7.8%
c 8
 
7.8%
i 7
 
6.8%
r 5
 
4.9%
o 5
 
4.9%
m 5
 
4.9%
g 4
 
3.9%
Other values (12) 27
26.2%

bac
Categorical

HIGH CORRELATION  UNIFORM 

Distinct2
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
N/realiz
Neg

Length

Max length8
Median length5.5
Mean length5.5
Min length3

Characters and Unicode

Total characters22
Distinct characters9
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowN/realiz
2nd rowNeg
3rd rowN/realiz
4th rowNeg

Common Values

ValueCountFrequency (%)
N/realiz 2
50.0%
Neg 2
50.0%

Length

2023-10-31T16:44:37.901405image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:38.045345image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
n/realiz 2
50.0%
neg 2
50.0%

Most occurring characters

ValueCountFrequency (%)
N 4
18.2%
e 4
18.2%
/ 2
9.1%
r 2
9.1%
a 2
9.1%
l 2
9.1%
i 2
9.1%
z 2
9.1%
g 2
9.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 16
72.7%
Uppercase Letter 4
 
18.2%
Other Punctuation 2
 
9.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 4
25.0%
r 2
12.5%
a 2
12.5%
l 2
12.5%
i 2
12.5%
z 2
12.5%
g 2
12.5%
Uppercase Letter
ValueCountFrequency (%)
N 4
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 20
90.9%
Common 2
 
9.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
N 4
20.0%
e 4
20.0%
r 2
10.0%
a 2
10.0%
l 2
10.0%
i 2
10.0%
z 2
10.0%
g 2
10.0%
Common
ValueCountFrequency (%)
/ 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 22
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N 4
18.2%
e 4
18.2%
/ 2
9.1%
r 2
9.1%
a 2
9.1%
l 2
9.1%
i 2
9.1%
z 2
9.1%
g 2
9.1%

BACOUTRO
Categorical

Distinct3
Distinct (%)75.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
N/realiz
Neg
Pos

Length

Max length8
Median length5.5
Mean length5.5
Min length3

Characters and Unicode

Total characters22
Distinct characters12
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)50.0%

Sample

1st rowNeg
2nd rowN/realiz
3rd rowN/realiz
4th rowPos

Common Values

ValueCountFrequency (%)
N/realiz 2
50.0%
Neg 1
25.0%
Pos 1
25.0%

Length

2023-10-31T16:44:38.164270image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:38.314281image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
n/realiz 2
50.0%
neg 1
25.0%
pos 1
25.0%

Most occurring characters

ValueCountFrequency (%)
N 3
13.6%
e 3
13.6%
/ 2
9.1%
r 2
9.1%
a 2
9.1%
l 2
9.1%
i 2
9.1%
z 2
9.1%
g 1
 
4.5%
P 1
 
4.5%
Other values (2) 2
9.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 16
72.7%
Uppercase Letter 4
 
18.2%
Other Punctuation 2
 
9.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 3
18.8%
r 2
12.5%
a 2
12.5%
l 2
12.5%
i 2
12.5%
z 2
12.5%
g 1
 
6.2%
o 1
 
6.2%
s 1
 
6.2%
Uppercase Letter
ValueCountFrequency (%)
N 3
75.0%
P 1
 
25.0%
Other Punctuation
ValueCountFrequency (%)
/ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 20
90.9%
Common 2
 
9.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
N 3
15.0%
e 3
15.0%
r 2
10.0%
a 2
10.0%
l 2
10.0%
i 2
10.0%
z 2
10.0%
g 1
 
5.0%
P 1
 
5.0%
o 1
 
5.0%
Common
ValueCountFrequency (%)
/ 2
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 22
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N 3
13.6%
e 3
13.6%
/ 2
9.1%
r 2
9.1%
a 2
9.1%
l 2
9.1%
i 2
9.1%
z 2
9.1%
g 1
 
4.5%
P 1
 
4.5%
Other values (2) 2
9.1%

cultEsc
Categorical

Distinct1
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
N/realiz

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters32
Distinct characters8
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowN/realiz
2nd rowN/realiz
3rd rowN/realiz
4th rowN/realiz

Common Values

ValueCountFrequency (%)
N/realiz 4
100.0%

Length

2023-10-31T16:44:38.437132image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:38.570976image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
n/realiz 4
100.0%

Most occurring characters

ValueCountFrequency (%)
N 4
12.5%
/ 4
12.5%
r 4
12.5%
e 4
12.5%
a 4
12.5%
l 4
12.5%
i 4
12.5%
z 4
12.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 24
75.0%
Uppercase Letter 4
 
12.5%
Other Punctuation 4
 
12.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r 4
16.7%
e 4
16.7%
a 4
16.7%
l 4
16.7%
i 4
16.7%
z 4
16.7%
Uppercase Letter
ValueCountFrequency (%)
N 4
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 28
87.5%
Common 4
 
12.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
N 4
14.3%
r 4
14.3%
e 4
14.3%
a 4
14.3%
l 4
14.3%
i 4
14.3%
z 4
14.3%
Common
ValueCountFrequency (%)
/ 4
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 32
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N 4
12.5%
/ 4
12.5%
r 4
12.5%
e 4
12.5%
a 4
12.5%
l 4
12.5%
i 4
12.5%
z 4
12.5%

RX
Categorical

Distinct3
Distinct (%)75.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
Susp TB
Normal
N/realiz

Length

Max length8
Median length7.5
Mean length7
Min length6

Characters and Unicode

Total characters28
Distinct characters17
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)50.0%

Sample

1st rowSusp TB
2nd rowSusp TB
3rd rowNormal
4th rowN/realiz

Common Values

ValueCountFrequency (%)
Susp TB 2
50.0%
Normal 1
25.0%
N/realiz 1
25.0%

Length

2023-10-31T16:44:38.688542image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:38.854586image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
susp 2
33.3%
tb 2
33.3%
normal 1
16.7%
n/realiz 1
16.7%

Most occurring characters

ValueCountFrequency (%)
S 2
 
7.1%
s 2
 
7.1%
p 2
 
7.1%
2
 
7.1%
T 2
 
7.1%
B 2
 
7.1%
N 2
 
7.1%
u 2
 
7.1%
r 2
 
7.1%
a 2
 
7.1%
Other values (7) 8
28.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 17
60.7%
Uppercase Letter 8
28.6%
Space Separator 2
 
7.1%
Other Punctuation 1
 
3.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 2
11.8%
p 2
11.8%
u 2
11.8%
r 2
11.8%
a 2
11.8%
l 2
11.8%
e 1
5.9%
i 1
5.9%
o 1
5.9%
m 1
5.9%
Uppercase Letter
ValueCountFrequency (%)
S 2
25.0%
T 2
25.0%
B 2
25.0%
N 2
25.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 25
89.3%
Common 3
 
10.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
S 2
 
8.0%
s 2
 
8.0%
p 2
 
8.0%
T 2
 
8.0%
B 2
 
8.0%
N 2
 
8.0%
u 2
 
8.0%
r 2
 
8.0%
a 2
 
8.0%
l 2
 
8.0%
Other values (5) 5
20.0%
Common
ValueCountFrequency (%)
2
66.7%
/ 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 28
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
S 2
 
7.1%
s 2
 
7.1%
p 2
 
7.1%
2
 
7.1%
T 2
 
7.1%
B 2
 
7.1%
N 2
 
7.1%
u 2
 
7.1%
r 2
 
7.1%
a 2
 
7.1%
Other values (7) 8
28.6%

NECROP
Categorical

Distinct1
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
N/realiz

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters32
Distinct characters8
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowN/realiz
2nd rowN/realiz
3rd rowN/realiz
4th rowN/realiz

Common Values

ValueCountFrequency (%)
N/realiz 4
100.0%

Length

2023-10-31T16:44:38.977117image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:39.110209image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
n/realiz 4
100.0%

Most occurring characters

ValueCountFrequency (%)
N 4
12.5%
/ 4
12.5%
r 4
12.5%
e 4
12.5%
a 4
12.5%
l 4
12.5%
i 4
12.5%
z 4
12.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 24
75.0%
Uppercase Letter 4
 
12.5%
Other Punctuation 4
 
12.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r 4
16.7%
e 4
16.7%
a 4
16.7%
l 4
16.7%
i 4
16.7%
z 4
16.7%
Uppercase Letter
ValueCountFrequency (%)
N 4
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 28
87.5%
Common 4
 
12.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
N 4
14.3%
r 4
14.3%
e 4
14.3%
a 4
14.3%
l 4
14.3%
i 4
14.3%
z 4
14.3%
Common
ValueCountFrequency (%)
/ 4
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 32
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N 4
12.5%
/ 4
12.5%
r 4
12.5%
e 4
12.5%
a 4
12.5%
l 4
12.5%
i 4
12.5%
z 4
12.5%

hiv
Categorical

Distinct2
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
Neg
Pos

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters12
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)25.0%

Sample

1st rowNeg
2nd rowNeg
3rd rowPos
4th rowNeg

Common Values

ValueCountFrequency (%)
Neg 3
75.0%
Pos 1
 
25.0%

Length

2023-10-31T16:44:39.217992image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:39.358011image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
neg 3
75.0%
pos 1
 
25.0%

Most occurring characters

ValueCountFrequency (%)
N 3
25.0%
e 3
25.0%
g 3
25.0%
P 1
 
8.3%
o 1
 
8.3%
s 1
 
8.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 8
66.7%
Uppercase Letter 4
33.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 3
37.5%
g 3
37.5%
o 1
 
12.5%
s 1
 
12.5%
Uppercase Letter
ValueCountFrequency (%)
N 3
75.0%
P 1
 
25.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 12
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
N 3
25.0%
e 3
25.0%
g 3
25.0%
P 1
 
8.3%
o 1
 
8.3%
s 1
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N 3
25.0%
e 3
25.0%
g 3
25.0%
P 1
 
8.3%
o 1
 
8.3%
s 1
 
8.3%

aids
Boolean

Distinct1
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size36.0 B
False
ValueCountFrequency (%)
False 4
100.0%
2023-10-31T16:44:39.489603image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

DIABETES
Boolean

Distinct1
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size36.0 B
False
ValueCountFrequency (%)
False 4
100.0%
2023-10-31T16:44:39.610372image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

ALCOOLISMO
Categorical

HIGH CORRELATION  UNIFORM 

Distinct2
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
N
S

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters4
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowN
2nd rowS
3rd rowN
4th rowS

Common Values

ValueCountFrequency (%)
N 2
50.0%
S 2
50.0%

Length

2023-10-31T16:44:39.720149image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:39.857757image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
n 2
50.0%
s 2
50.0%

Most occurring characters

ValueCountFrequency (%)
N 2
50.0%
S 2
50.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 4
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
N 2
50.0%
S 2
50.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
N 2
50.0%
S 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N 2
50.0%
S 2
50.0%

MENTAL
Boolean

Distinct1
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size36.0 B
False
ValueCountFrequency (%)
False 4
100.0%
2023-10-31T16:44:39.987603image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/

DROGADICAO
Categorical

HIGH CORRELATION  UNIFORM 

Distinct2
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
N
S

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters4
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowN
2nd rowS
3rd rowS
4th rowN

Common Values

ValueCountFrequency (%)
N 2
50.0%
S 2
50.0%

Length

2023-10-31T16:44:40.092239image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:40.230785image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
n 2
50.0%
s 2
50.0%

Most occurring characters

ValueCountFrequency (%)
N 2
50.0%
S 2
50.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 4
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
N 2
50.0%
S 2
50.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
N 2
50.0%
S 2
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N 2
50.0%
S 2
50.0%

TABAGISMO
Categorical

Distinct2
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
N
S

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters4
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)25.0%

Sample

1st rowN
2nd rowS
3rd rowN
4th rowN

Common Values

ValueCountFrequency (%)
N 3
75.0%
S 1
 
25.0%

Length

2023-10-31T16:44:40.842239image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:40.982656image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
n 3
75.0%
s 1
 
25.0%

Most occurring characters

ValueCountFrequency (%)
N 3
75.0%
S 1
 
25.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 4
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
N 3
75.0%
S 1
 
25.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
N 3
75.0%
S 1
 
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N 3
75.0%
S 1
 
25.0%

motMudEsquema
Categorical

Distinct1
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
Nulo

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters16
Distinct characters4
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNulo
2nd rowNulo
3rd rowNulo
4th rowNulo

Common Values

ValueCountFrequency (%)
Nulo 4
100.0%

Length

2023-10-31T16:44:41.097539image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:41.230231image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
nulo 4
100.0%

Most occurring characters

ValueCountFrequency (%)
N 4
25.0%
u 4
25.0%
l 4
25.0%
o 4
25.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 12
75.0%
Uppercase Letter 4
 
25.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
u 4
33.3%
l 4
33.3%
o 4
33.3%
Uppercase Letter
ValueCountFrequency (%)
N 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 16
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
N 4
25.0%
u 4
25.0%
l 4
25.0%
o 4
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 16
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N 4
25.0%
u 4
25.0%
l 4
25.0%
o 4
25.0%

tipoTrat
Categorical

Distinct1
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
Supervisionado

Length

Max length14
Median length14
Mean length14
Min length14

Characters and Unicode

Total characters56
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSupervisionado
2nd rowSupervisionado
3rd rowSupervisionado
4th rowSupervisionado

Common Values

ValueCountFrequency (%)
Supervisionado 4
100.0%

Length

2023-10-31T16:44:41.338132image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:41.474094image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
supervisionado 4
100.0%

Most occurring characters

ValueCountFrequency (%)
i 8
14.3%
o 8
14.3%
S 4
7.1%
u 4
7.1%
p 4
7.1%
e 4
7.1%
r 4
7.1%
v 4
7.1%
s 4
7.1%
n 4
7.1%
Other values (2) 8
14.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 52
92.9%
Uppercase Letter 4
 
7.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 8
15.4%
o 8
15.4%
u 4
7.7%
p 4
7.7%
e 4
7.7%
r 4
7.7%
v 4
7.7%
s 4
7.7%
n 4
7.7%
a 4
7.7%
Uppercase Letter
ValueCountFrequency (%)
S 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 56
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 8
14.3%
o 8
14.3%
S 4
7.1%
u 4
7.1%
p 4
7.1%
e 4
7.1%
r 4
7.1%
v 4
7.1%
s 4
7.1%
n 4
7.1%
Other values (2) 8
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 56
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 8
14.3%
o 8
14.3%
S 4
7.1%
u 4
7.1%
p 4
7.1%
e 4
7.1%
r 4
7.1%
v 4
7.1%
s 4
7.1%
n 4
7.1%
Other values (2) 8
14.3%

idade
Categorical

Distinct2
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
23_39
Mais de 54

Length

Max length10
Median length5
Mean length6.25
Min length5

Characters and Unicode

Total characters25
Distinct characters13
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)25.0%

Sample

1st rowMais de 54
2nd row23_39
3rd row23_39
4th row23_39

Common Values

ValueCountFrequency (%)
23_39 3
75.0%
Mais de 54 1
 
25.0%

Length

2023-10-31T16:44:41.589330image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:41.737835image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
23_39 3
50.0%
mais 1
 
16.7%
de 1
 
16.7%
54 1
 
16.7%

Most occurring characters

ValueCountFrequency (%)
3 6
24.0%
2 3
12.0%
_ 3
12.0%
9 3
12.0%
2
 
8.0%
M 1
 
4.0%
a 1
 
4.0%
i 1
 
4.0%
s 1
 
4.0%
d 1
 
4.0%
Other values (3) 3
12.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 14
56.0%
Lowercase Letter 5
 
20.0%
Connector Punctuation 3
 
12.0%
Space Separator 2
 
8.0%
Uppercase Letter 1
 
4.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 6
42.9%
2 3
21.4%
9 3
21.4%
5 1
 
7.1%
4 1
 
7.1%
Lowercase Letter
ValueCountFrequency (%)
a 1
20.0%
i 1
20.0%
s 1
20.0%
d 1
20.0%
e 1
20.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%
Uppercase Letter
ValueCountFrequency (%)
M 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 19
76.0%
Latin 6
 
24.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 6
31.6%
2 3
15.8%
_ 3
15.8%
9 3
15.8%
2
 
10.5%
5 1
 
5.3%
4 1
 
5.3%
Latin
ValueCountFrequency (%)
M 1
16.7%
a 1
16.7%
i 1
16.7%
s 1
16.7%
d 1
16.7%
e 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 25
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 6
24.0%
2 3
12.0%
_ 3
12.0%
9 3
12.0%
2
 
8.0%
M 1
 
4.0%
a 1
 
4.0%
i 1
 
4.0%
s 1
 
4.0%
d 1
 
4.0%
Other values (3) 3
12.0%

HISTOPATOL
Categorical

Distinct1
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
N/realiz

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters32
Distinct characters8
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowN/realiz
2nd rowN/realiz
3rd rowN/realiz
4th rowN/realiz

Common Values

ValueCountFrequency (%)
N/realiz 4
100.0%

Length

2023-10-31T16:44:41.853847image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:41.986487image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
n/realiz 4
100.0%

Most occurring characters

ValueCountFrequency (%)
N 4
12.5%
/ 4
12.5%
r 4
12.5%
e 4
12.5%
a 4
12.5%
l 4
12.5%
i 4
12.5%
z 4
12.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 24
75.0%
Uppercase Letter 4
 
12.5%
Other Punctuation 4
 
12.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r 4
16.7%
e 4
16.7%
a 4
16.7%
l 4
16.7%
i 4
16.7%
z 4
16.7%
Uppercase Letter
ValueCountFrequency (%)
N 4
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 28
87.5%
Common 4
 
12.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
N 4
14.3%
r 4
14.3%
e 4
14.3%
a 4
14.3%
l 4
14.3%
i 4
14.3%
z 4
14.3%
Common
ValueCountFrequency (%)
/ 4
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 32
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N 4
12.5%
/ 4
12.5%
r 4
12.5%
e 4
12.5%
a 4
12.5%
l 4
12.5%
i 4
12.5%
z 4
12.5%
Distinct1
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
1

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters4
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1

Common Values

ValueCountFrequency (%)
1 4
100.0%

Length

2023-10-31T16:44:42.100149image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:42.246159image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
1 4
100.0%

Most occurring characters

ValueCountFrequency (%)
1 4
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 4
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 4
100.0%

Cluster
Categorical

Distinct1
Distinct (%)25.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
2

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters4
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2

Common Values

ValueCountFrequency (%)
2 4
100.0%

Length

2023-10-31T16:44:42.352175image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:42.487037image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
2 4
100.0%

Most occurring characters

ValueCountFrequency (%)
2 4
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 4
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 4
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 4
100.0%

Probabilidade
Categorical

HIGH CORRELATION  UNIFORM  UNIQUE 

Distinct4
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size64.0 B
0.3571453191116193
0.3319027214101606
0.2761522297340586
0.27454173409707605

Length

Max length19
Median length18
Mean length18.25
Min length18

Characters and Unicode

Total characters73
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)100.0%

Sample

1st row0.3571453191116193
2nd row0.3319027214101606
3rd row0.2761522297340586
4th row0.27454173409707605

Common Values

ValueCountFrequency (%)
0.3571453191116193 1
25.0%
0.3319027214101606 1
25.0%
0.2761522297340586 1
25.0%
0.27454173409707605 1
25.0%

Length

2023-10-31T16:44:42.600326image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-10-31T16:44:42.760738image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
ValueCountFrequency (%)
0.3571453191116193 1
25.0%
0.3319027214101606 1
25.0%
0.2761522297340586 1
25.0%
0.27454173409707605 1
25.0%

Most occurring characters

ValueCountFrequency (%)
1 12
16.4%
0 11
15.1%
7 8
11.0%
3 7
9.6%
2 7
9.6%
5 6
8.2%
4 6
8.2%
6 6
8.2%
9 5
6.8%
. 4
 
5.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 69
94.5%
Other Punctuation 4
 
5.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 12
17.4%
0 11
15.9%
7 8
11.6%
3 7
10.1%
2 7
10.1%
5 6
8.7%
4 6
8.7%
6 6
8.7%
9 5
7.2%
8 1
 
1.4%
Other Punctuation
ValueCountFrequency (%)
. 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 73
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 12
16.4%
0 11
15.1%
7 8
11.0%
3 7
9.6%
2 7
9.6%
5 6
8.2%
4 6
8.2%
6 6
8.2%
9 5
6.8%
. 4
 
5.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 73
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 12
16.4%
0 11
15.1%
7 8
11.0%
3 7
9.6%
2 7
9.6%
5 6
8.2%
4 6
8.2%
6 6
8.2%
9 5
6.8%
. 4
 
5.5%

Correlations

2023-10-31T16:44:42.902609image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
faixaEtariaESCOLARIDsitAtualFORMACLIN1classifdescobertabacBACOUTRORXhivALCOOLISMODROGADICAOTABAGISMOidadeProbabilidade
faixaEtaria1.0000.0000.0000.0000.0000.0000.7070.0000.0000.7070.7070.0000.0000.7071.000
ESCOLARID0.0001.0000.7070.0000.0000.0000.0000.0000.0000.0000.0000.0000.7070.7071.000
sitAtual0.0000.7071.0000.0000.0000.7070.0000.0000.0000.0000.0000.0000.0000.0001.000
FORMACLIN10.0000.0000.0001.0001.0000.0000.0000.0001.0000.7070.0000.0000.0000.0001.000
classif0.0000.0000.0001.0001.0000.0000.0000.0001.0000.7070.0000.0000.0000.0001.000
descoberta0.0000.0000.7070.0000.0001.0000.0000.0000.0000.7070.0000.7070.7070.0001.000
bac0.7070.0000.0000.0000.0000.0001.0000.0000.0000.0000.0000.0000.0000.0001.000
BACOUTRO0.0000.0000.0000.0000.0000.0000.0001.0000.0000.0000.0000.7070.0000.7071.000
RX0.0000.0000.0001.0001.0000.0000.0000.0001.0000.7070.0000.0000.0000.0001.000
hiv0.7070.0000.0000.7070.7070.7070.0000.0000.7071.0000.0000.0000.0000.0001.000
ALCOOLISMO0.7070.0000.0000.0000.0000.0000.0000.0000.0000.0001.0000.0000.0000.0001.000
DROGADICAO0.0000.0000.0000.0000.0000.7070.0000.7070.0000.0000.0001.0000.0000.0001.000
TABAGISMO0.0000.7070.0000.0000.0000.7070.0000.0000.0000.0000.0000.0001.0000.0001.000
idade0.7070.7070.0000.0000.0000.0000.0000.7070.0000.0000.0000.0000.0001.0001.000
Probabilidade1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000

Missing values

2023-10-31T16:44:34.581146image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
A simple visualization of nullity by column.
2023-10-31T16:44:35.091970image/svg+xmlMatplotlib v3.6.0, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

faixaEtariasexoESCOLARIDTIPOCUPsitAtualtipoCasoFORMACLIN1classifdescobertabacBACOUTROcultEscRXNECROPhivaidsDIABETESALCOOLISMOMENTALDROGADICAOTABAGISMOmotMudEsquematipoTratidadeHISTOPATOLStatus_ResistenciaClusterProbabilidade
149060_69MNenhumaOutraCuraNovoPulPulElucidacao Diagn. em InternacaoN/realizNegN/realizSusp TBN/realizNegNNNNNNNuloSupervisionadoMais de 54N/realiz120.357145
79430_39MDe 4 a 7 anosOutraAbandonoNovoPulPulDemanda AmbulatorialNegN/realizN/realizSusp TBN/realizNegNNSNSSNuloSupervisionado23_39N/realiz120.331903
141620_29MDe 8 a 11 anosOutraCuraNovoMultiplos OrgaosDissemUrgencia / EmergenciaN/realizN/realizN/realizNormalN/realizPosNNNNSNNuloSupervisionado23_39N/realiz120.276152
131030_39MDe 8 a 11 anosOutraCuraNovoPleuralExtElucidacao Diagn. em InternacaoNegPosN/realizN/realizN/realizNegNNSNNNNuloSupervisionado23_39N/realiz120.274542
faixaEtariasexoESCOLARIDTIPOCUPsitAtualtipoCasoFORMACLIN1classifdescobertabacBACOUTROcultEscRXNECROPhivaidsDIABETESALCOOLISMOMENTALDROGADICAOTABAGISMOmotMudEsquematipoTratidadeHISTOPATOLStatus_ResistenciaClusterProbabilidade
149060_69MNenhumaOutraCuraNovoPulPulElucidacao Diagn. em InternacaoN/realizNegN/realizSusp TBN/realizNegNNNNNNNuloSupervisionadoMais de 54N/realiz120.357145
79430_39MDe 4 a 7 anosOutraAbandonoNovoPulPulDemanda AmbulatorialNegN/realizN/realizSusp TBN/realizNegNNSNSSNuloSupervisionado23_39N/realiz120.331903
141620_29MDe 8 a 11 anosOutraCuraNovoMultiplos OrgaosDissemUrgencia / EmergenciaN/realizN/realizN/realizNormalN/realizPosNNNNSNNuloSupervisionado23_39N/realiz120.276152
131030_39MDe 8 a 11 anosOutraCuraNovoPleuralExtElucidacao Diagn. em InternacaoNegPosN/realizN/realizN/realizNegNNSNNNNuloSupervisionado23_39N/realiz120.274542