python决策树sklearn_python – 获取sklearn中节点的决策路径

100 阅读 0 评论 66 点赞

我是靠谱客的博主风趣云朵，最近开发中收集的这篇文章主要介绍python决策树sklearn_python – 获取sklearn中节点的决策路径，觉得挺不错的，现在分享给大家，希望可以做个参考。

概述

对于使用iris数据集的节点的决策规则：

from sklearn.datasets import load_iris

from sklearn import tree

import graphviz

iris = load_iris()

clf = tree.DecisionTreeClassifier()

clf = clf.fit(iris.data, iris.target)

dot_data = tree.export_graphviz(clf, out_file=None,

feature_names=iris.feature_names,

class_names=iris.target_names,

filled=True, rounded=True,

special_characters=True)

graph = graphviz.Source(dot_data)

#this will create an iris.pdf file with the rule path

graph.render("iris")

对于基于样本的路径,请使

import numpy as np

from sklearn.model_selection import train_test_split

from sklearn.datasets import load_iris

from sklearn.tree import DecisionTreeClassifier

iris = load_iris()

X = iris.data

y = iris.target

X_train, X_test, y_train, y_test = train_test_split(X, y, random_state=0)

estimator = DecisionTreeClassifier(max_leaf_nodes=3, random_state=0)

estimator.fit(X_train, y_train)

# The decision estimator has an attribute called tree_ which stores the entire

# tree structure and allows access to low level attributes. The binary tree

# tree_ is represented as a number of parallel arrays. The i-th element of each

# array holds information about the node `i`. Node 0 is the tree's root. NOTE:

# Some of the arrays only apply to either leaves or split nodes, resp. In this

# case the values of nodes of the other type are arbitrary!

# Among those arrays, we have:

# - left_child, id of the left child of the node

# - right_child, id of the right child of the node

# - feature, feature used for splitting the node

# - threshold, threshold value at the node

n_nodes = estimator.tree_.node_count

children_left = estimator.tree_.children_left

children_right = estimator.tree_.children_right

feature = estimator.tree_.feature

threshold = estimator.tree_.threshold

# The tree structure can be traversed to compute various properties such

# as the depth of each node and whether or not it is a leaf.

node_depth = np.zeros(shape=n_nodes, dtype=np.int64)

is_leaves = np.zeros(shape=n_nodes, dtype=bool)

stack = [(0, -1)] # seed is the root node id and its parent depth

while len(stack) > 0:

node_id, parent_depth = stack.pop()

node_depth[node_id] = parent_depth + 1

# If we have a test node

if (children_left[node_id] != children_right[node_id]):

stack.append((children_left[node_id], parent_depth + 1))

stack.append((children_right[node_id], parent_depth + 1))

else:

is_leaves[node_id] = True

print("The binary tree structure has %s nodes and has "

"the following tree structure:"

% n_nodes)

for i in range(n_nodes):

if is_leaves[i]:

print("%snode=%s leaf node." % (node_depth[i] * "", i))

else:

print("%snode=%s test node: go to node %s if X[:, %s] <= %s else to "

"node %s."

% (node_depth[i] * "",

children_left[i],

feature[i],

threshold[i],

children_right[i],

))

print()

# First let's retrieve the decision path of each sample. The decision_path

# method allows to retrieve the node indicator functions. A non zero element of

# indicator matrix at the position (i, j) indicates that the sample i goes

# through the node j.

node_indicator = estimator.decision_path(X_test)

# Similarly, we can also have the leaves ids reached by each sample.

leave_id = estimator.apply(X_test)

# Now, it's possible to get the tests that were used to predict a sample or

# a group of samples. First, let's make it for the sample.

# HERE IS WHAT YOU WANT

sample_id = 0

node_index = node_indicator.indices[node_indicator.indptr[sample_id]:

node_indicator.indptr[sample_id + 1]]

print('Rules used to predict sample %s: ' % sample_id)

for node_id in node_index:

if leave_id[sample_id] == node_id: # <-- changed != to ==

#continue # <-- comment out

print("leaf node {} reached, no decision here".format(leave_id[sample_id])) # <--

else: # < -- added else to iterate through decision nodes

if (X_test[sample_id, feature[node_id]] <= threshold[node_id]):

threshold_sign = "<="

else:

threshold_sign = ">"

print("decision id node %s : (X[%s, %s] (= %s) %s %s)"

% (node_id,

sample_id,

feature[node_id],

X_test[sample_id, feature[node_id]], # <-- changed i to sample_id

threshold_sign,

threshold[node_id]))

这将在最后打印以下内容：

用于预测样本0的规则：

决策ID节点0：(X [0,3](= 2.4)> 0.800000011920929)

决策ID节点2：(X [0,2](= 5.1)> 4.949999809265137)

叶节点4到达了,这里没有决定

最后

以上就是风趣云朵为你收集整理的python决策树sklearn_python – 获取sklearn中节点的决策路径的全部内容，希望文章能够帮你解决python决策树sklearn_python – 获取sklearn中节点的决策路径所遇到的程序开发问题。

如果觉得靠谱客网站的内容还不错，欢迎将靠谱客网站推荐给程序员好友。

本图文内容来源于网友提供，作为学习参考使用，或来自网络收集整理，版权属于原作者所有。

本文分类：python决策树sklearn
浏览次数：100 次浏览
发布日期：2023-06-22 10:43:01
本文链接：https://www.kaopuke.com/article/k-p-k_14_uzo_26_fz_12__23__6_y.html

verilog adc代码_HDLBits：在线学习 Verilog （十八 · Problem 85-89）

数电四：触发器1、基本RS触发器2、时钟控制R-S触发器

飞行机器人专栏（八）-- AGX Xavier 通信、控制及视觉应用开发0. Introduction of Jetson Developer kits1. Introduction of AGX Xavier2. NVIDIA SDK manager and Jetpack 3. rootfs eMMC to SSDhttps://github.com/jetsonhacks/rootOnNVMe/issues/21https://github.com/jetsonhacks/rootOn

python决策树sklearn_python – 获取sklearn中节点的决策路径

概述

最后

评论列表共有 0 条评论

发表评论取消回复

python决策树sklearn_python – 获取sklearn中节点的决策路径

概述

最后

相关文章

评论列表共有 0 条评论

发表评论 取消回复

发表评论取消回复