R でデシジョンツリーを描画する方法 (例付き)

によるベンジャミン・アンダーソン博士 7月 17, 2023 ガイド 0コメント

機械学習において、デシジョンツリーは、一連の予測子変数を使用して応答変数の値を予測するデシジョンツリーを作成するモデルの一種です。

R でデシジョンツリーをプロットする最も簡単な方法は、 rpart.plotパッケージのprp()関数を使用することです。

次の例は、この関数を実際に使用する方法を示しています。

例: R で決定木を描画する

この例では、 ISLRパッケージのHittersデータセットを使用します。これには、263 人のプロ野球選手に関するさまざまな情報が含まれています。

このデータセットを使用して、ホームランと出場年数を使用して特定の選手の年俸を予測する回帰ツリーを構築します。

次のコードは、この回帰ツリーを当てはめる方法と、 prp()関数を使用してツリーを描画する方法を示しています。

 library (ISLR)
library (rpart)
library (rpart.plot)

#build the initial decision tree
tree <- rpart(Salary ~ Years + HmRun, data=Hitters, control=rpart. control (cp= .0001 ))

#identify best cp value to use
best <- tree$cptable[which. min (tree$cptable[," xerror "])," CP "]

#produce a pruned tree based on the best cp value
pruned_tree <- prune (tree, cp=best)

#plot the pruned tree
prp(pruned_tree)

prp () 関数のfaclen 、 extra 、 roundintおよびDigits引数を使用して、デシジョンツリーの外観をカスタマイズすることもできることに注意してください。

 #plot decision tree using custom arguments
prp(pruned_tree,
    faclen= 0 , #use full names for factor labels
    extra= 1 , #display number of observations for each terminal node
    roundint= F , #don't round to integers in output
    digits= 5 ) #display 5 decimal places in output

Rで決定木を描く