English 中文(简体)
自然语言
原标题:Natural language de-identification
  • 时间:2012-01-12 23:21:29
  •  标签:
  • nlp

我正在寻找一种能够自动确定英文文本的自然语言工具。 例如,每个电子邮件地址都应改称或模糊。 但是,应当对适当的名称进行非识别,并处理不了。

http://mist-deid.sourceforge.net/“rel=“nofollow” MITRE ID Scrubber Toolkit。 我不知道它是如何运作的。

我的问题:

  • Are there any other tools out there?
  • Does anyone have experience with the MITRE tool? How well does it work?

感谢。

最佳回答

身份查验(也许更经常地称为anonymization)是一个非常积极的研究领域,因为其成功显然是在《国家扫盲计划》等领域使用真正的文本公司进行保健、药品等。 我建议你研究在。 如果你进一步跟踪这些联系,你将找到研究论文,说明这些工具如何进一步参考和成果评价。

问题回答

暂无回答




相关问题
Java Stanford NLP: Part of Speech labels?

The Stanford NLP, demo d here, gives an output like this: Colorless/JJ green/JJ ideas/NNS sleep/VBP furiously/RB ./. What do the Part of Speech tags mean? I am unable to find an official list. Is it ...

Java Stanford NLP: Find word frequency?

I m using the Stanford NLP Parsing toolkit. Given a word in the lexicon, how can I find its frequency*? Or, given a frequency rank, how can I determine the corresponding word? *in the entire language,...

c/c++ NLP library [closed]

I am looking for an open source Natural Language Processing library for c/c++ and especially i am interested in Part of speech tagging.

Clustering text in Python [closed]

I need to cluster some text documents and have been researching various options. It looks like LingPipe can cluster plain text without prior conversion (to vector space etc), but it s the only tool I ...

Natural language rendering

Do you know any frameworks that implement natural language rendering concept ? I ve found several NLP oriented frameworks like Anthelope or Open NLP but they have only parsers but not renderers or ...

热门标签