发布于2019-08-07 13:33 阅读(1275) 评论(0) 点赞(4) 收藏(4)
df_train = pd.read_csv('happiness_train_complete.csv', encoding='gbk', parse_dates=['survey_time'])
df_test = pd.read_csv('happiness_test_complete.csv', encoding='gbk', parse_dates=['survey_time'])
df_train['happiness'] = df_train['happiness'].replace(-8, 3)
def fill_data(df):
df['hukou_loc'] = df['hukou_loc'].fillna(4)
df['family_income'] = df['family_income'].fillna(df['family_income'].mean())
df.fillna(0, inplace=True) # 其余使用0填充
def generator_attribute(df):
df['survey_time'] = df['survey_time'].dt.year
df['age'] = df['survey_time'] - df['birth']
generator_attribute(df_train)
generator_attribute(df_test)
plt.figure(figsize=(16, 8))
plt.subplot(1, 2, 1)
df_train['happiness'].value_counts().plot(kind='pie', autopct='%1.1f%%')
plt.subplot(1, 2, 2)
df_train['happiness'].value_counts().plot(kind='bar')
plt.savefig('happiness_distribution.png')
plt.figure(figsize=(16, 8))
sns.countplot(data=df_train, x='gender', hue='happiness')
plt.title('different happiness level by gender')
plt.savefig('gender_bar.png')
plt.figure(figsize=(16, 8))
plt.subplot(1, 2, 1)
df_train['happiness'][df_train['gender'] == 1].value_counts().plot(kind='pie', autopct='%1.1f%%')
plt.subplot(1, 2, 2)
df_train['happiness'][df_train['gender'] == 2].value_counts().plot(kind='pie', autopct='%1.1f%%')
plt.title('different happiness level percent by gender')
plt.savefig('gender_pie.png')
作者:085iitirtu
链接:https://www.pythonheidong.com/blog/article/11237/3ddb32e9c68466cbd07b/
来源:python黑洞网
任何形式的转载都请注明出处,如有侵权 一经发现 必将追究其法律责任
昵称:
评论内容:(最多支持255个字符)
---无人问津也好,技不如人也罢,你都要试着安静下来,去做自己该做的事,而不是让内心的烦躁、焦虑,坏掉你本来就不多的热情和定力
Copyright © 2018-2021 python黑洞网 All Rights Reserved 版权所有,并保留所有权利。 京ICP备18063182号-1
投诉与举报,广告合作请联系vgs_info@163.com或QQ3083709327
免责声明:网站文章均由用户上传,仅供读者学习交流使用,禁止用做商业用途。若文章涉及色情,反动,侵权等违法信息,请向我们举报,一经核实我们会立即删除!