Pandas：导入 excel 文件时忽略特定列

经过本杰明·安德森博 13 7 月, 2023 指导 0 条评论

将 Excel 文件导入 pandas DataFrame 时，可以使用以下基本语法忽略特定列：

 #define columns to skip
skip_cols = [1, 2]

#define columns to keep
keep_cols = [i for i in range (4) if i not in skip_cols]

#import Excel file and skip specific columns
df = pd. read_excel (' my_data.xlsx ', usecols=keep_cols)

将名为my_data.xlsx的 Excel 文件导入 pandas 时，此特定示例将忽略索引位置1和2中的列。

以下示例展示了如何在实践中使用此语法。

示例：将 Excel 文件导入 Pandas 时忽略特定列

假设我们有以下名为player_data.xlsx的 Excel 文件：

我们可以使用以下语法将此文件导入到 pandas DataFrame 中，并在导入过程中忽略索引位置1和2中的列（点列和弹跳列）：

 #define columns to skip
skip_cols = [1, 2]

#define columns to keep
keep_cols = [i for i in range (4) if i not in skip_cols]

#import Excel file and skip specific columns
df = pd. read_excel (' player_data.xlsx ', usecols=keep_cols)

#view DataFrame
print (df)

  team assists
0 to 5
1 B 3
2 C 7
3 D 8
4 E 8
5 F 9

请注意，Excel 文件中除索引位置1和2处的列（点和反弹列）外的所有列均已导入 pandas DataFrame。

请注意，此方法假设您事先知道 Excel 文件中有多少列。

由于我们知道文件中共有 4 列，因此我们使用range(4)来定义要保留的列。

注意：您可以在此处找到 pandas read_excel()函数的完整文档。

其他资源

以下教程解释了如何在 pandas 中执行其他常见任务：

Pandas：如何在读取 Excel 文件时跳行
 Pandas：导入 Excel 文件时如何指定类型
 Pandas：如何合并多个 Excel 工作表

关于作者

本杰明·安德森博

大家好，我是本杰明，一位退休的统计学教授，后来成为 Statorials 的热心教师。凭借在统计领域的丰富经验和专业知识，我渴望分享我的知识，通过 Statorials 增强学生的能力。了解更多

示例：将 Excel 文件导入 Pandas 时忽略特定列

其他资源

关于作者

本杰明·安德森博

添加评论