我想计算每行出现在一列(“Comment”)的每一行中的每个单词的数量,并将其存储在名为headlight的数据框中的新列(“word”)。
我正在尝试下面的代码,但是,我得到和错误。在for i in range(0,len(headlamp)):
headlamp['word'].apply(lambda text: Counter(" ".join(headlamp['Comment'][i].astype(str)).split(" ")).items())
---------------------------------------------------------------------------
KeyError Traceback (most recent call last)
in ()
1 for i in range(0,len(headlamp)):
----> 2 headlamp['word'].apply(lambda text: Counter("".join(headlamp['Comment'][i].astype(str)).split(" ")).items())
C:UsersRafaelAnaconda2envsgl-envlibsite-packagespandascoreframe.pyc in __getitem__(self, key)
1995 return self._getitem_multilevel(key)
1996 else:
-> 1997 return self._getitem_column(key)
1998
1999 def _getitem_column(self, key):
C:UsersRafaelAnaconda2envsgl-envlibsite-packagespandascoreframe.pyc in _getitem_column(self, key)
2002 # get column
2003 if self.columns.is_unique:
-> 2004 return self._get_item_cache(key)
2005
2006 # duplicate columns & possible reduce dimensionality
C:UsersRafaelAnaconda2envsgl-envlibsite-packagespandascoregeneric.pyc in _get_item_cache(self, item)
1348 res = cache.get(item)
1349 if res is None:
-> 1350 values = self._data.get(item)
1351 res = self._box_item_values(item, values)
1352 cache[item] = res
C:UsersRafaelAnaconda2envsgl-envlibsite-packagespandascoreinternals.pyc in get(self, item, fastpath)
3288
3289 if not isnull(item):
-> 3290 loc = self.items.get_loc(item)
3291 else:
3292 indexer = np.arange(len(self.items))[isnull(self.items)]
C:UsersRafaelAnaconda2envsgl-envlibsite-packagespandasindexesbase.pyc in get_loc(self, key, method, tolerance)
1945 return self._engine.get_loc(key)
1946 except KeyError:
-> 1947 returnself._engine.get_loc(self._maybe_cast_indexer(key))
1948
1949 indexer = self.get_indexer([key], method=method, tolerance=tolerance)
pandasindex.pyx in pandas.index.IndexEngine.get_loc (pandasindex.c:4154)()
pandasindex.pyx in pandas.index.IndexEngine.get_loc (pandasindex.c:4018)()
pandashashtable.pyx in pandas.hashtable.PyObjectHashTable.get_item (pandashashtable.c:12368)()
pandashashtable.pyx in pandas.hashtable.PyObjectHashTable.get_item (pandashashtable.c:12322)()
KeyError: 'word'
任何帮助都将不胜感激
最后
以上就是过时硬币最近收集整理的关于python如何保存计算数据_Python:计算一列中单词的频率,并将结果存储到我的数据fram上的另一列中...的全部内容,更多相关python如何保存计算数据_Python:计算一列中单词内容请搜索靠谱客的其他文章。
发表评论 取消回复