我最近用我的一份电子邮件同我的主管一道,任期为一年。 自2006年以来 我正在对面板用户情况进行数据清点项目,他说,我正在收集深入真实的数据。
我对这个词非常新,我对这个词进行了在线查询,但从数据污染的角度看,我发现这方面的成果很少。
谁能给我一个例子,说明这种严酷的数据在数据清醒的任务中究竟是什么?
非常感谢。
我最近用我的一份电子邮件同我的主管一道,任期为一年。 自2006年以来 我正在对面板用户情况进行数据清点项目,他说,我正在收集深入真实的数据。
我对这个词非常新,我对这个词进行了在线查询,但从数据污染的角度看,我发现这方面的成果很少。
谁能给我一个例子,说明这种严酷的数据在数据清醒的任务中究竟是什么?
非常感谢。
Ground-truth is data annotated (generally by human) known to be sure at 100%. It s used to train algorithm since it s what you expect the algorithm to give you.
Which forums you are using for data mining questions? SO is mainly intended for programming, not for DM questions.
I m working on a project at the moment where I need to pick out the most common phrases in a huge body of text. For example say we have three sentences like the following: The dog jumped over the ...
I d like to find patterns and sort them by number of occurrences on an HEX file I have. I am not looking for some specific pattern, just to make some statistics of the occurrences happening there and ...
Does anybody know about any dataming libraries for .net?
I m planning to develop program in Java which will provide diagnosis. The data set is divided into two parts one for training and the other for testing. My program should learn to classify from the ...
I m using the explorer feature of Weka for classification. So I have my .arff file, with 2 features of NUMERIC value, and my class is a binary 0 or 1 (eg {0,1}). Sample: @RELATION summary @...
I ve got a somewhat ugly field in a database which holds the names of locations. For instance, Madison Square Gardens which has also been entered as "The Madison Square Gardens", etc. etc. I m ...
This is intended as a question quite open to any suggestions, hints or pointers. I wish to start playing around with home brewed automated investment models, the beginnings of which I have concepts ...