Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 4653 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 712 |
Duplicate rows (%) | 15.3% |
Total size in memory | 327.3 KiB |
Average record size in memory | 72.0 B |
Variable types
Categorical | 5 |
---|---|
Numeric | 3 |
Boolean | 1 |
Dataset has 712 (15.3%) duplicate rows | Duplicates |
EverBenched is highly imbalanced (52.2%) | Imbalance |
ExperienceInCurrentDomain has 355 (7.6%) zeros | Zeros |
Reproduction
Analysis started | 2023-10-04 17:49:58.598698 |
---|---|
Analysis finished | 2023-10-04 17:50:00.684643 |
Duration | 2.09 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
Education
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 36.5 KiB |
Bachelors | |
---|---|
Masters | |
PHD | 179 |
Common Values
Value | Count | Frequency (%) |
Bachelors | 3601 | |
Masters | 873 | 18.8% |
PHD | 179 | 3.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
bachelors | 3601 | |
masters | 873 | 18.8% |
phd | 179 | 3.8% |
Most occurring characters
Value | Count | Frequency (%) |
s | 5347 | |
a | 4474 | |
e | 4474 | |
r | 4474 | |
B | 3601 | |
c | 3601 | |
h | 3601 | |
l | 3601 | |
o | 3601 | |
M | 873 | 2.2% |
Other values (4) | 1410 | 3.6% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 34046 | |
Uppercase Letter | 5011 | 12.8% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
s | 5347 | |
a | 4474 | |
e | 4474 | |
r | 4474 | |
c | 3601 | |
h | 3601 | |
l | 3601 | |
o | 3601 | |
t | 873 | 2.6% |
Uppercase Letter
Value | Count | Frequency (%) |
B | 3601 | |
M | 873 | 17.4% |
P | 179 | 3.6% |
H | 179 | 3.6% |
D | 179 | 3.6% |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 39057 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
s | 5347 | |
a | 4474 | |
e | 4474 | |
r | 4474 | |
B | 3601 | |
c | 3601 | |
h | 3601 | |
l | 3601 | |
o | 3601 | |
M | 873 | 2.2% |
Other values (4) | 1410 | 3.6% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 39057 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
s | 5347 | |
a | 4474 | |
e | 4474 | |
r | 4474 | |
B | 3601 | |
c | 3601 | |
h | 3601 | |
l | 3601 | |
o | 3601 | |
M | 873 | 2.2% |
Other values (4) | 1410 | 3.6% |
JoiningYear
Real number (ℝ)
Distinct | 7 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2015.063 |
Minimum | 2012 |
---|---|
Maximum | 2018 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 36.5 KiB |
Quantile statistics
Minimum | 2012 |
---|---|
5-th percentile | 2012 |
Q1 | 2013 |
median | 2015 |
Q3 | 2017 |
95-th percentile | 2018 |
Maximum | 2018 |
Range | 6 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 1.8633768 |
---|---|
Coefficient of variation (CV) | 0.00092472387 |
Kurtosis | -1.2044253 |
Mean | 2015.063 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -0.11346207 |
Sum | 9376088 |
Variance | 3.4721732 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2017 | 1108 | |
2015 | 781 | |
2014 | 699 | |
2013 | 669 | |
2016 | 525 | |
2012 | 504 | |
2018 | 367 | 7.9% |
Value | Count | Frequency (%) |
2012 | 504 | |
2013 | 669 | |
2014 | 699 | |
2015 | 781 | |
2016 | 525 | |
2017 | 1108 | |
2018 | 367 | 7.9% |
Value | Count | Frequency (%) |
2018 | 367 | 7.9% |
2017 | 1108 | |
2016 | 525 | |
2015 | 781 | |
2014 | 699 | |
2013 | 669 | |
2012 | 504 |
City
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 36.5 KiB |
Bangalore | |
---|---|
Pune | |
New Delhi |
Common Values
Value | Count | Frequency (%) |
Bangalore | 2228 | |
Pune | 1268 | |
New Delhi | 1157 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
bangalore | 2228 | |
pune | 1268 | |
new | 1157 | |
delhi | 1157 |
Most occurring characters
Value | Count | Frequency (%) |
e | 5810 | |
a | 4456 | |
n | 3496 | |
l | 3385 | |
B | 2228 | 6.3% |
g | 2228 | 6.3% |
o | 2228 | 6.3% |
r | 2228 | 6.3% |
P | 1268 | 3.6% |
u | 1268 | 3.6% |
Other values (6) | 6942 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 28570 | |
Uppercase Letter | 5810 | 16.3% |
Space Separator | 1157 | 3.3% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 5810 | |
a | 4456 | |
n | 3496 | |
l | 3385 | |
g | 2228 | 7.8% |
o | 2228 | 7.8% |
r | 2228 | 7.8% |
u | 1268 | 4.4% |
w | 1157 | 4.0% |
h | 1157 | 4.0% |
Uppercase Letter
Value | Count | Frequency (%) |
B | 2228 | |
P | 1268 | |
N | 1157 | |
D | 1157 |
Space Separator
Value | Count | Frequency (%) |
1157 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 34380 | |
Common | 1157 | 3.3% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 5810 | |
a | 4456 | |
n | 3496 | |
l | 3385 | |
B | 2228 | 6.5% |
g | 2228 | 6.5% |
o | 2228 | 6.5% |
r | 2228 | 6.5% |
P | 1268 | 3.7% |
u | 1268 | 3.7% |
Other values (5) | 5785 |
Common
Value | Count | Frequency (%) |
1157 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 35537 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 5810 | |
a | 4456 | |
n | 3496 | |
l | 3385 | |
B | 2228 | 6.3% |
g | 2228 | 6.3% |
o | 2228 | 6.3% |
r | 2228 | 6.3% |
P | 1268 | 3.6% |
u | 1268 | 3.6% |
Other values (6) | 6942 |
PaymentTier
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 36.5 KiB |
3 | |
---|---|
2 | |
1 | 243 |
Common Values
Value | Count | Frequency (%) |
3 | 3492 | |
2 | 918 | 19.7% |
1 | 243 | 5.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
3 | 3492 | |
2 | 918 | 19.7% |
1 | 243 | 5.2% |
Most occurring characters
Value | Count | Frequency (%) |
3 | 3492 | |
2 | 918 | 19.7% |
1 | 243 | 5.2% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 4653 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
3 | 3492 | |
2 | 918 | 19.7% |
1 | 243 | 5.2% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 4653 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
3 | 3492 | |
2 | 918 | 19.7% |
1 | 243 | 5.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 4653 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
3 | 3492 | |
2 | 918 | 19.7% |
1 | 243 | 5.2% |
Age
Real number (ℝ)
Distinct | 20 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 29.393295 |
Minimum | 22 |
---|---|
Maximum | 41 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 36.5 KiB |
Quantile statistics
Minimum | 22 |
---|---|
5-th percentile | 24 |
Q1 | 26 |
median | 28 |
Q3 | 32 |
95-th percentile | 39 |
Maximum | 41 |
Range | 19 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 4.826087 |
---|---|
Coefficient of variation (CV) | 0.16419007 |
Kurtosis | -0.29982315 |
Mean | 29.393295 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.90519516 |
Sum | 136767 |
Variance | 23.291116 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
26 | 645 | |
28 | 630 | |
27 | 625 | |
25 | 418 | 9.0% |
24 | 385 | 8.3% |
29 | 230 | 4.9% |
30 | 220 | 4.7% |
37 | 141 | 3.0% |
36 | 139 | 3.0% |
34 | 136 | 2.9% |
Other values (10) | 1084 |
Value | Count | Frequency (%) |
22 | 49 | 1.1% |
23 | 48 | 1.0% |
24 | 385 | |
25 | 418 | |
26 | 645 | |
27 | 625 | |
28 | 630 | |
29 | 230 | 4.9% |
30 | 220 | 4.7% |
31 | 125 | 2.7% |
Value | Count | Frequency (%) |
41 | 82 | |
40 | 134 | |
39 | 131 | |
38 | 136 | |
37 | 141 | |
36 | 139 | |
35 | 123 | |
34 | 136 | |
33 | 124 | |
32 | 132 |
Gender
Categorical
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 36.5 KiB |
Male | |
---|---|
Female |
Common Values
Value | Count | Frequency (%) |
Male | 2778 | |
Female | 1875 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
male | 2778 | |
female | 1875 |
Most occurring characters
Value | Count | Frequency (%) |
e | 6528 | |
a | 4653 | |
l | 4653 | |
M | 2778 | |
F | 1875 | 8.4% |
m | 1875 | 8.4% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 17709 | |
Uppercase Letter | 4653 | 20.8% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 6528 | |
a | 4653 | |
l | 4653 | |
m | 1875 | 10.6% |
Uppercase Letter
Value | Count | Frequency (%) |
M | 2778 | |
F | 1875 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 22362 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 6528 | |
a | 4653 | |
l | 4653 | |
M | 2778 | |
F | 1875 | 8.4% |
m | 1875 | 8.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 22362 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 6528 | |
a | 4653 | |
l | 4653 | |
M | 2778 | |
F | 1875 | 8.4% |
m | 1875 | 8.4% |
EverBenched
Boolean
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.7 KiB |
False | |
---|---|
True |
Value | Count | Frequency (%) |
False | 4175 | |
True | 478 | 10.3% |
ExperienceInCurrentDomain
Real number (ℝ)
ZEROS
 
Distinct | 8 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.9056523 |
Minimum | 0 |
---|---|
Maximum | 7 |
Zeros | 355 |
Zeros (%) | 7.6% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 36.5 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 2 |
median | 3 |
Q3 | 4 |
95-th percentile | 5 |
Maximum | 7 |
Range | 7 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.5582403 |
---|---|
Coefficient of variation (CV) | 0.53627901 |
Kurtosis | -0.96941346 |
Mean | 2.9056523 |
Median Absolute Deviation (MAD) | 1 |
Skewness | -0.16255594 |
Sum | 13520 |
Variance | 2.4281129 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 1087 | |
4 | 931 | |
5 | 919 | |
3 | 786 | |
1 | 558 | |
0 | 355 | 7.6% |
7 | 9 | 0.2% |
6 | 8 | 0.2% |
Value | Count | Frequency (%) |
0 | 355 | 7.6% |
1 | 558 | |
2 | 1087 | |
3 | 786 | |
4 | 931 | |
5 | 919 | |
6 | 8 | 0.2% |
7 | 9 | 0.2% |
Value | Count | Frequency (%) |
7 | 9 | 0.2% |
6 | 8 | 0.2% |
5 | 919 | |
4 | 931 | |
3 | 786 | |
2 | 1087 | |
1 | 558 | |
0 | 355 | 7.6% |
LeaveOrNot
Categorical
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 36.5 KiB |
0 | |
---|---|
1 |
Common Values
Value | Count | Frequency (%) |
0 | 3053 | |
1 | 1600 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 3053 | |
1 | 1600 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 3053 | |
1 | 1600 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 4653 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 3053 | |
1 | 1600 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 4653 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 3053 | |
1 | 1600 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 4653 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 3053 | |
1 | 1600 |
JoiningYear | Age | ExperienceInCurrentDomain | Education | City | PaymentTier | Gender | EverBenched | LeaveOrNot | |
---|---|---|---|---|---|---|---|---|---|
JoiningYear | 1.000 | 0.008 | -0.038 | 0.214 | 0.201 | 0.267 | 0.150 | 0.131 | 0.417 |
Age | 0.008 | 1.000 | -0.142 | 0.018 | 0.027 | 0.000 | 0.000 | 0.023 | 0.066 |
ExperienceInCurrentDomain | -0.038 | -0.142 | 1.000 | 0.118 | 0.052 | 0.026 | 0.000 | 0.000 | 0.039 |
Education | 0.214 | 0.018 | 0.118 | 1.000 | 0.316 | 0.183 | 0.008 | 0.056 | 0.146 |
City | 0.201 | 0.027 | 0.052 | 0.316 | 1.000 | 0.295 | 0.214 | 0.021 | 0.209 |
PaymentTier | 0.267 | 0.000 | 0.026 | 0.183 | 0.295 | 1.000 | 0.275 | 0.009 | 0.269 |
Gender | 0.150 | 0.000 | 0.000 | 0.008 | 0.214 | 0.275 | 1.000 | 0.012 | 0.220 |
EverBenched | 0.131 | 0.023 | 0.000 | 0.056 | 0.021 | 0.009 | 0.012 | 1.000 | 0.076 |
LeaveOrNot | 0.417 | 0.066 | 0.039 | 0.146 | 0.209 | 0.269 | 0.220 | 0.076 | 1.000 |
Education | JoiningYear | City | PaymentTier | Age | Gender | EverBenched | ExperienceInCurrentDomain | LeaveOrNot | |
---|---|---|---|---|---|---|---|---|---|
0 | Bachelors | 2017 | Bangalore | 3 | 34 | Male | No | 0 | 0 |
1 | Bachelors | 2013 | Pune | 1 | 28 | Female | No | 3 | 1 |
2 | Bachelors | 2014 | New Delhi | 3 | 38 | Female | No | 2 | 0 |
3 | Masters | 2016 | Bangalore | 3 | 27 | Male | No | 5 | 1 |
4 | Masters | 2017 | Pune | 3 | 24 | Male | Yes | 2 | 1 |
5 | Bachelors | 2016 | Bangalore | 3 | 22 | Male | No | 0 | 0 |
6 | Bachelors | 2015 | New Delhi | 3 | 38 | Male | No | 0 | 0 |
7 | Bachelors | 2016 | Bangalore | 3 | 34 | Female | No | 2 | 1 |
8 | Bachelors | 2016 | Pune | 3 | 23 | Male | No | 1 | 0 |
9 | Masters | 2017 | New Delhi | 2 | 37 | Male | No | 2 | 0 |
Education | JoiningYear | City | PaymentTier | Age | Gender | EverBenched | ExperienceInCurrentDomain | LeaveOrNot | |
---|---|---|---|---|---|---|---|---|---|
4643 | Bachelors | 2013 | Bangalore | 3 | 31 | Female | No | 5 | 0 |
4644 | Bachelors | 2015 | Pune | 3 | 32 | Female | Yes | 1 | 1 |
4645 | Masters | 2017 | Pune | 2 | 31 | Female | No | 2 | 0 |
4646 | Bachelors | 2013 | Bangalore | 3 | 25 | Female | No | 3 | 0 |
4647 | Bachelors | 2016 | Pune | 3 | 30 | Male | No | 2 | 0 |
4648 | Bachelors | 2013 | Bangalore | 3 | 26 | Female | No | 4 | 0 |
4649 | Masters | 2013 | Pune | 2 | 37 | Male | No | 2 | 1 |
4650 | Masters | 2018 | New Delhi | 3 | 27 | Male | No | 5 | 1 |
4651 | Bachelors | 2012 | Bangalore | 3 | 30 | Male | Yes | 2 | 0 |
4652 | Bachelors | 2015 | Bangalore | 3 | 33 | Male | Yes | 4 | 0 |
Most frequently occurring
Education | JoiningYear | City | PaymentTier | Age | Gender | EverBenched | ExperienceInCurrentDomain | LeaveOrNot | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|
78 | Bachelors | 2013 | Bangalore | 3 | 26 | Male | No | 4 | 0 | 32 |
367 | Bachelors | 2016 | Bangalore | 3 | 26 | Male | No | 4 | 0 | 28 |
11 | Bachelors | 2012 | Bangalore | 3 | 26 | Male | No | 4 | 0 | 26 |
153 | Bachelors | 2014 | Bangalore | 3 | 25 | Male | No | 3 | 0 | 24 |
157 | Bachelors | 2014 | Bangalore | 3 | 26 | Male | No | 4 | 0 | 24 |
161 | Bachelors | 2014 | Bangalore | 3 | 27 | Male | No | 5 | 0 | 24 |
440 | Bachelors | 2017 | Bangalore | 3 | 27 | Male | No | 5 | 0 | 24 |
81 | Bachelors | 2013 | Bangalore | 3 | 27 | Male | No | 5 | 0 | 21 |
150 | Bachelors | 2014 | Bangalore | 3 | 24 | Male | No | 2 | 0 | 20 |
261 | Bachelors | 2015 | Bangalore | 3 | 26 | Male | No | 4 | 0 | 20 |