(可视化模型表现)Visualizing Model Performance

来源:互联网 发布:如何设计一个软件 编辑:程序博客网 时间:2024/05/16 19:54
  • Stakeholders outside of the data science team may have little patience for details, and will often want a higher-level, more intuitive view of model performance.

     

  •  It is important for the data scientist to realize that clear communication with key stakeholders is not only a primary goal of her job, but also is essential for doing the right modeling (in addition to doing the modeling right).

     

    ————————————————————————————————————————————————

     

    主要内容:

    1. Profit curves
    1. ROC  graph (receiver operating characteristic)
      • AUC graph (The area under the ROC Curve)
    2. Cumulative response and lift curve

     

    ————————————————————————————————————————————————

     

    (一)Profit curve

     

     

    The disadvantage of a profit graph is that it requires that operating conditions be known and specified exactly. With many real-world problems, the operating conditions are imprecise or change over time, and the data scientist must contend with uncertainty.

     

     

    (二)ROC graph

     

    在先验分布和损益不清楚的情况下,我们可以依据不同情况依次建立模型绘制Profit graph,但这显然不能让人满意

     

    为解决这个问题,我们可以将各种情况的模型表现都绘制在一张图中,其中一种方法便是 Receiver Operating Characteristics (ROC) graph

     

     

     

    ROC graph 的详细产生过程见下图

     

     

    ROC数据点的计算方式

     

    Confusion Matrix

     



    P



    N


    Y


    a



    b



    N



    c



    d

     

    ROC  Point

     

     

    An advantage of ROC graphs is that they decouple classifier performance from the conditions under which the classifiers will be used. Specifically, they are independent of the class proportions as well as the costs and benefits.

     

     

    The area under the RCO  curve (AUC)

    • Though a ROC curve provides more information than its area, the AUC is useful when a single number is needed to summarize performance, or when nothing is known about the operating conditions.

     

     

    (三)Cumulative response and lift curve

     

0 0