library( ”randomForest” )
training <-read.csv("d:/Users/Lenovo-pc/Documents/2013data1.csv",header=TRUE) /导入数据/
ind <-sample(2,nrow(training),replace=TRUE,prob=c(0.7,0.3)) /对数据分成两部分,70%训练数据,30%检测数据/
traindata<- training [ind==1,]
testdata<- training [ind==2,]
set.seed(100)
rf<-randomForest(oneyear~.,data=training,training[ind==1,],ntree=100,nPerm=10,mtry=3,proximity=TRUE,importance=TRUE)
这里显示 Error in na.fail.default(list(oneyear = c(1L, 1L, 1L, 1L, 1L, 1L, 1L, :
missing values in object
table(predict(rf),training$id)
print(rf)
plot(rf)
importance(rf)
还有我想问一下,oneyear是最终的分类结果,那么分类的标准,也就是变量是怎么确定的呢?oneyear在行还是列?
请问
显示 Error in na.fail.default(list(oneyear = c(1L, 1L, 1L, 1L, 1L, 1L, 1L, :
missing values in object
是什么意思?
代码感觉是对的