Editing
Stop Word
Jump to navigation
Jump to search
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
Stop Word (單一高頻字、停用字、停止字串) English: a, of, the, in, is, she, he, to be, as, because, if, when Chinese(Traditional): 的 一 是 不 人 在 有 我 了 中 ... 這 個 來 為 國 們 著 時 會 說 Chinese(Simplified): 的 一 是 不 人 在 有 我 了 中 ... 这 个 来 为 国 们 着 时 会 说 use case * search the gmail inclue "这 OR 个 OR 来 OR 为 OR 国 OR 们 OR 时 OR 说" for deleting spam letters in Chinese(Simplified) download file * [https://github.com/stopwords-iso/stopwords-zh/tree/master stopwords-iso/stopwords-zh: Chinese stopwords collection] references * [http://www.google.com.tw/intl/zh-TW/insidesearch/tipstricks/all.html#characters 搜尋提示及秘訣 – 搜尋主頁 – Google] "在搜尋中包含或略過特定字詞及字元 如果某些常見字詞及字元 (例如「the」和「&」) 對您的搜尋至關重要,請在其前後加上英文引號,例如電影或書名中的「the」可標示為「"the"」。 ... ... " {{access | date = 2015-02-15}} * [http://humanum.arts.cuhk.edu.hk/Lexis/chifreq/ Chinese Character Frequency Statistics for Hong Kong, Mainland China and Taiwan - A Trans-Regional, Diachronic Survey]: 香港、大陸、台灣 - 跨地區、跨年代漢語常用字頻統計 {{access | date = 2015-11-24}} * [http://www.ranks.nl/stopwords Stopwords] "Collection of stopword lists in many languages." {{access | date = 2015-11-24}} * [https://en.wikipedia.org/wiki/Stop_words Stop words - Wikipedia] {{access | date = 2016-11-14}} * Adobe (n.d.). [https://helpx.adobe.com/experience-manager/kb/Stopwordlist.html Optimize search by adding stop words] {{access | date = 2017-06-22}} 提供德國 (de)、英文 (en)、西班牙 (es)、法國 (fr)、荷蘭 (nl)、瑞典 (se) 語言的停用字。 * [http://scikit-learn.org/stable/modules/generated/sklearn.feature_extraction.text.CountVectorizer.html sklearn.feature_extraction.text.CountVectorizer — scikit-learn 0.19.1 documentation] {{access | date = 2018-04-14}} * 中文停用字: [https://github.com/zake7749/word2vec-tutorial/blob/master/jieba_dict/stopwords.txt word2vec-tutorial/stopwords.txt at master · zake7749/word2vec-tutorial · GitHub] [[Category:NLP]] [[Category:Search]]
Summary:
Please note that all contributions to LemonWiki共筆 are considered to be released under the Creative Commons Attribution-NonCommercial-ShareAlike (see
LemonWiki:Copyrights
for details). If you do not want your writing to be edited mercilessly and redistributed at will, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource.
Do not submit copyrighted work without permission!
Cancel
Editing help
(opens in new window)
Template used on this page:
Template:Access
(
view source
) (protected)
Navigation menu
Personal tools
Not logged in
Talk
Contributions
Log in
Namespaces
Page
Discussion
English
Views
Read
Edit
View history
More
Search
Navigation
Main page
Current events
Recent changes
Random page
Help
Categories
Tools
What links here
Related changes
Special pages
Page information