Oct-13-2019, 01:38 PM
I have a dataset with Age column which has data as follows:
I want to create groups and then visualize histogram groups something like below:
Please advise
df_s7['Age'].unique()array(['28 Years', '10 Month(s) 15 Day(s)', '46 Years', '65 Years', '45 Years', '30 Years', '47 Years', '17 Years', '55 Years', '50 Years', '39 Years', '42 Years', '38 Years', '40 Years', '20 Years', ' < 1 Year', '29 Years', '43 Years', '31 Years', '36 Years', '11 Years', '48 Years', '23 Years', '25 Years', '32 Years', '82 Years', '44 Years', '37 Years', '52 Years', '35 Years', '18 Years', '19 Years', '49 Years', '62 Years', '51 Years', '72 Years', '26 Years', '54 Years', '24 Years', '59 Years', '34 Years', '53 Years', '14 Years', '71 Years', '27 Years', '66 Years', '33 Years', '22 Years', '70 Years', '60 Years', '21 Years', '3 Month(s) 11 Day(s)', '58 Years', '56 Years', '63 Years', '5 Years', '64 Years', '10 Years', '16 Years', '15 Years', '75 Years', '57 Years', '2 Years', '83 Years', '77 Years', '74 Years', '13 Years', '41 Years', '69 Years', '1 Month(s) 29 Day(s)', '8 Years', '7 Month(s) 16 Day(s)', '61 Years', '67 Years', '1 Month(s) 30 Day(s)', '84 Years', '1 Month(s) 12 Day(s)', '6 Month(s) 26 Day(s)', '12 Years', '5 Month(s) 18 Day(s)', '68 Years', '80 Years', '3 Month(s) 19 Day(s)', '76 Years', '86 Years', '7 Month(s) 2 Day(s)', '1 Years', '73 Years', '90 Years', '6 Month(s) 20 Day(s)', '79 Years', '89 Years', '9 Years', '3 Month(s) 29 Day(s)', '8 Month(s) 21 Day(s)', '4 Years', '6 Month(s) 8 Day(s)', '78 Years', '6 Years', '87 Years', '7 Years', '6 Month(s) 9 Day(s)', '4 Month(s) 20 Day(s)', '10 Month(s) 16 Day(s)', '4 Month(s) 11 Day(s)', '6 Month(s) 18 Day(s)', '4 Month(s) 13 Day(s)'], dtype=object)
I want to create groups and then visualize histogram groups something like below:
def age_buckets(x): if x < 1: return '0-1' if x < 17: return '1-17' if x < 30: return '18-29' elif x < 40: return '30-39' elif x < 50: return '40-49' elif x < 60: return '50-59' elif x < 70: return '60-69' elif x >=70: return '70+' else: return 'other'There is also a "Sex" column- Male, Female, Transgender 1) I want to plot(1D) histogram only on Age col based different age groups and color code 2) plot based on Age & Sex column for different age groups and color code
Please advise