Question

I have an application which decides whether a human is handwaving,running or walking. The idea is i have segmented an action,say handwave,to its poses. Let s say

例;

for human1:pose7-pose3-pose7-..... represents handwave
for human3:pose1-pose7-pose1-..... represents handwave
for human7:pose1-pose1-pose7-..... represents handwave
for human20:pose3-pose7-pose7-..... represents handwave

for human1 pose11-pose33-pose77-..... represents walking
for human2 pose31-pose33-pose77-..... represents walking
for human3 pose11-pose77-pose77-..... represents walking
for human20 pose11-pose33-pose11-..... represents walking

i 用于在马特拉布培训SVER和NeuralNet。

现在,我测试了图像。再次重申我对测试图像进行了分类。

For the vector sizes of test and train sets in MATLAB; SVM and Neural Net requires same vector sizes. To make it work;
If I append 0 (assume it like pose0-which is an invalid pose) , to make sizes equal I have really good performance.
If I copy initial poses at the beginning and append them to the end until sizes are equal performance decreases.

例如:

train set: pose1-pose2-pose4-pose7-pose2-pose4-pose7
(1st method)test set: pose3-pose1-pose4-0-0-0-0 or
(2nd method)test set: pose3-pose1-pose4-pose3-pose1-pose4-pose3

由于所附的数值是实际的数值,因此,我预计采用第二种方法进行更好的分类。但pose0不是真实的。

Do you have any ideas ? Regards

Answer 1

就你而言,你的数据包括各有一系列特征的事例(如PoseSlot1、PoseSlot2、...、PoseSlotN)和类别值(直航、运行或行走)。

你们的问题是,所有班级的特征并不相同,例如,行走时有7个。

处理这类问题的标准方法是用missing Value标示这些空档,假定你的机器学习算法能够处理缺失的数值。

f1     f2    f3    f4    f5    f6    f7    class
-------------------------------------------------
pose1,pose2,pose4,pose7,pose2,pose4,pose7,running
pose3,pose1,pose4,    ?,    ?,    ?,    ?,walking

现在,你使用的第一个计算法是使用<代码>的简化标准?计算缺失值(相当于添加新的表示,表示缺失价值,而不是一个明确的<代码>?数值)。

重复价值的其他方式实际上造成了一个问题,而不是如果你认为是解决问题的。你们实际上创造了相关的特征,而且正如你所知,大多数机器学习算法最符合一套独立的特征(通常通过作为预处理步骤进行特征选择来解决)。

Answer 2

我不认为,从你的第一个方法中获得更好的业绩是不合理的。我假定,你指的是在更好的分类方面表现较好。我的推论是,交错序列通常较短。因此,当你填满“无效”时,通过将不同行为列为无效行为,比实际构成更容易加以区别。

友情链接