<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom"><title type="text">博客园_码农.KEN的园子_分类_算法/理论</title><id>http://feed.cnblogs.com/blog/u/30096/category/250652/rss</id><updated>2012-06-03T01:37:25Z</updated><generator>feed.cnblogs.com</generator><link rel="alternate" type="text/html" href="http://www.cnblogs.com/ken-zhang/category/250652.html"/><link rel="self" type="application/atom+xml" href="http://feed.cnblogs.com/blog/u/30096/category/250652/rss"/><entry><id>http://www.cnblogs.com/ken-zhang/archive/2010/06/20/1761111.html</id><title type="text">【转】TF-IDF算法扫盲2</title><summary type="text">本文转载自http://www.mryang.org/logs/45675845.htmlTF-IDF算法是一种简单快捷的文档特征词抽取方法，通过统计文档中的词频来对文档进行主题分类。TF-IDF(term frequency–inverse document frequency)是一种统计方法，用以评估一字词对于一个文件集或一个语料库中的其中一份文件的重要程度。字词的重要性随着它在文...</summary><published>2010-06-19T16:25:00Z</published><updated>2010-06-19T16:25:00Z</updated><author><name>码农.ＫＥＮ</name><uri>http://www.cnblogs.com/ken-zhang/</uri></author><link rel="alternate" href="http://www.cnblogs.com/ken-zhang/archive/2010/06/20/1761111.html"/><link rel="alternate" type="text/html" href="http://www.cnblogs.com/ken-zhang/archive/2010/06/20/1761111.html"/><content type="html"/></entry></feed>
