Difference between revisions of "Named entity recognition tools"

From LemonWiki共筆
Jump to navigation Jump to search
Line 13: Line 13:
 
<tr><td>norp</td><td>團體</td></tr>
 
<tr><td>norp</td><td>團體</td></tr>
 
<tr><td>FAC</td><td>設施</td></tr>
 
<tr><td>FAC</td><td>設施</td></tr>
<tr><td>facility</td><td>設施</td></tr>
+
<tr><td>facility</td><td>設施*</td></tr>
 
<tr><td>ORG</td><td>組織</td></tr>
 
<tr><td>ORG</td><td>組織</td></tr>
<tr><td>organization</td><td>組織</td></tr>
+
<tr><td>organization</td><td>組織*</td></tr>
 
<tr><td>gpe</td><td>地理</td></tr>
 
<tr><td>gpe</td><td>地理</td></tr>
 
<tr><td>LOC</td><td>地點</td></tr>
 
<tr><td>LOC</td><td>地點</td></tr>
<tr><td>location</td><td>地點</td></tr>
+
<tr><td>location</td><td>地點*</td></tr>
 
<tr><td>product</td><td>商品</td></tr>
 
<tr><td>product</td><td>商品</td></tr>
 
<tr><td>event</td><td>事件</td></tr>
 
<tr><td>event</td><td>事件</td></tr>
 
<tr><td>WORK</td><td>藝術品</td></tr>
 
<tr><td>WORK</td><td>藝術品</td></tr>
<tr><td>work of art</td><td>藝術品</td></tr>
+
<tr><td>work of art</td><td>藝術品*</td></tr>
 
<tr><td>law</td><td>法律</td></tr>
 
<tr><td>law</td><td>法律</td></tr>
 
<tr><td>language</td><td>語言</td></tr>
 
<tr><td>language</td><td>語言</td></tr>
Line 33: Line 33:
 
<tr><td>cardinal</td><td>數詞</td></tr>
 
<tr><td>cardinal</td><td>數詞</td></tr>
 
</table>
 
</table>
 +
 +
: [[Image:Owl icon.jpg]] Notes: Wilcat symbol means there are different class name in English but same class name in Chinese.
  
 
== Stanford CoreNLP ==
 
== Stanford CoreNLP ==

Revision as of 09:50, 9 April 2020

Named entity recognition (NER) 或稱命名實體識別、實體識別、專有名詞辨識

CKIP Neural Chinese Word Segmentation, POS Tagging, and NER

ckiplab/ckiptagger: CKIP Neural Chinese Word Segmentation, POS Tagging, and NER

Class name in EnglishClass name in Traditional Chinese
person人名
norp團體
FAC設施
facility設施*
ORG組織
organization組織*
gpe地理
LOC地點
location地點*
product商品
event事件
WORK藝術品
work of art藝術品*
law法律
language語言
date日期
time時間
percent比例
money
quantity數量
ordinal序數
cardinal數詞
Owl icon.jpg Notes: Wilcat symbol means there are different class name in English but same class name in Chinese.

Stanford CoreNLP

Stanford CoreNLP – Natural language software | Stanford CoreNLP

  • license: GNU General Public License v3 Good!
  • language support:
  • programming language: Java
  • classes of entity: "For English, by default, this annotator recognizes named (PERSON, LOCATION, ORGANIZATION, MISC), numerical (MONEY, NUMBER, ORDINAL, PERCENT), and temporal (DATE, TIME, DURATION, SET) entities (12 classes). [2]"

spaCy

spaCy · Industrial-strength Natural Language Processing in Python

  • license: MIT License Good!
  • language support:
  • programming language: Python
  • classes of entity: "PERSON, NORP, FAC, ORG, GPE, LOC, PRODUCT, EVENT, WORK_OF_ART, LAW, LANGUAGE, DATE, TIME, PERCENT, MONEY, QUANTITY, ORDINAL and CARDINAL [3]"

Google Cloud Natural Language

Cloud Natural Language  |  Cloud Natural Language API  |  Google Cloud

Amazon Comprehend

Amazon Comprehend – 自然語言處理(NLP) 和機器學習 (ML)

  • license:
  • language support:
  • programming language:
  • classes of entity: "COMMERCIAL_ITEM, DATE, EVENT, LOCATION, ORGANIZATION, OTHER, PERSON, QUANTITY and TITLE"[4]

IBM Watson

Watson Natural Language Understanding

  • license:
  • language support:
  • programming language:
  • classes of entity: "Date, Duration, EmailAddress, Facility, GeographicFeature, Hashtag, IPAddress, JobTitle, Location and more ..."[5]

卓騰語言科技中文斷詞

卓騰語言科技中文斷詞 API

  • license:
  • language support: Traditional Chinese
  • programming language:
  • classes of entity: "person, location, time, measurement and more ... [6]"

BosonNLP

BosonNLP

  • license:
  • language support: simplified Chinese
  • programming language: multiple
  • classes of entity: "time, location, person_name, org_name, company_name, product_name and job_title [7]"
Class name in EnglishClass name in Simplified ChineseClass name in Traditional Chinese
time时间時間
location地点地點
person_name人名人名
org_name组织名組織名
company_name公司名公司名
product_name产品名產品名
job_title职位職位

百度AI开放平台

语言处理基础技术-百度AI开放平台 "专名识别"[8]

  • license:
  • language support: simplified Chinese
  • programming language: multiple
  • classes of entity:
Class name in English (缩略词)Class name in Simplified ChineseClass name in Traditional Chinese
PER人名人名
LOC地名地名
ORG机构名機構名
TIME时间時間

other NER tools

References