我有一个数据框架,如下所示(我的COVID案例的输入数据库)
数据:
date state cases
0 20200625 NY 300
1 20200625 CA 250
2 20200625 TX 200
3 20200625 FL 100
5 20200624 NY 290
6 20200624 CA 240
7 20200624 TX 100
8 20200624 FL 80
...
值得注意的是,上述数据中的"date“列是一个数字(不是datetime)
我想让它成为像这样的时间序列(期望的输出),日期作为索引,每个州的COVID用例作为列
NY CA TX FL
20200625 300 250 200 100
20200626 290 240 100 80
...
到目前为止,我只用以下代码创建了输出的框架
states = ['NY', 'CA', 'TX', 'FL']
days = [20200625, 20200626]
columns = states
positives = pd.DataFrame(columns = columns)
i = 0
for day in days:
positives.loc[i, "date"] = day
i = i +1
positives.set_index('date', inplace=True)
positives= positives.rename_axis(None)
print(positives)
它返回:
NY CA TX FL
20200625.0 NaN NaN NaN NaN
20200626.0 NaN NaN NaN NaN
当出现以下情况时,我如何从"data“数据框中获取"cases”列的值:
(i)数据中的值“state”=“积极”的列标题,
(ii)数据中的值“date”=“积极”的行索引
转载请注明出处:http://www.hanxingera.com/article/20230526/1599917.html