I m 为html网页安装一个网络报废器。 问题是背景关系,因为我需要决定一线内容与非行内容之间的关系,因此我可以说,这些内容是相关的,还是不是背景观点:
页: 1
$str1 = "president obama visited Barcelona yesterday"; //politics context
$str2 = "Barcelona was defeated from Chelsea yesterday"; //sports context
页: 1
$str3 = "Obama s appearance on Late Night With Jimmy Fallon "; //media context
$str4 = "Late Night show with jimmy fallon"; //mdeia context
第一例
<>strong>$1 and $str2 are大体不同,因此,关系可能为10% 或
第二例
<>strong>$str3 and $str4 are in the same context (media)beit$3 about President obama and the $4 about Ji Fallon but both are related to end road show,So relation may be 90%
I m using the Porter-Stemmer algorithm to remove the common endings from words. What to do next?