Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
286 views
in Technique[技术] by (71.8m points)

weka - Trained and Test data have different number of attributes that gave an error "Train and test set are not compatible"

I use WEKA for Text classification , I have trained data set , and I apply StringToWOrdVector and NumericToNominal filters , and have test data set and applied the same filters on it . When I try to apply my model on test data ,it gave me the following error Train and test set are not compatible I searched for a solution , the error occurred because number of attributes different between two sets, and it always be different because texts in two sets are different

How I can solve this error please ?

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Answer

0 votes
by (71.8m points)

The best thing you can do is combine your training and test set into one file and then apply the filter to it all in one go, then split them up again and copy the @attribute values from the combined file into both the training and test files. This way the attributes will be consistent across both files.


与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

2.1m questions

2.1m answers

60 comments

56.8k users

...