BinaryClassificationMetrics

Instance Constructors

new BinaryClassificationMetrics(scoreAndLabels: RDD[(Double, Double)])

Defaults numBins to 0.
Defaults numBins to 0.

Annotations
@Since( "1.0.0" )
new BinaryClassificationMetrics(scoreAndLabels: RDD[(Double, Double)], numBins: Int)

scoreAndLabels
an RDD of (score, label) pairs.
numBins
if greater than 0, then the curves (ROC curve, PR curve) computed internally will be down-sampled to this many "bins". If 0, no down-sampling will occur. This is useful because the curve contains a point for each distinct score in the input, and this could be as large as the input itself -- millions of points or more, when thousands may be entirely sufficient to summarize the curve. After down-sampling, the curves will instead be made of approximately numBins points instead. Points are made from bins of equal numbers of consecutive points. The size of each bin is floor(scoreAndLabels.count() / numBins), which means the resulting number of bins may not exactly equal numBins. The last bin in each partition may be smaller as a result, meaning there may be an extra sample at partition boundaries.

Annotations
@Since( "1.3.0" )

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def areaUnderPR(): Double

Computes the area under the precision-recall curve.
Computes the area under the precision-recall curve.

Annotations
@Since( "1.0.0" )
def areaUnderROC(): Double

Computes the area under the receiver operating characteristic (ROC) curve.
Computes the area under the receiver operating characteristic (ROC) curve.

Annotations
@Since( "1.0.0" )
final def asInstanceOf[T0]: T0

Definition Classes
Any
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def fMeasureByThreshold(): RDD[(Double, Double)]

Returns the (threshold, F-Measure) curve with beta = 1.0.
Returns the (threshold, F-Measure) curve with beta = 1.0.

Annotations
@Since( "1.0.0" )
def fMeasureByThreshold(beta: Double): RDD[(Double, Double)]

Returns the (threshold, F-Measure) curve.
Returns the (threshold, F-Measure) curve.
beta
the beta factor in F-Measure computation.
returns
an RDD of (threshold, F-Measure) pairs.

Annotations
@Since( "1.0.0" )
See also
http://en.wikipedia.org/wiki/F1_score
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
def hashCode(): Int

Definition Classes
AnyRef → Any
def initializeLogIfNecessary(isInterpreter: Boolean): Unit

Attributes
protected
Definition Classes
Logging
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
def isTraceEnabled(): Boolean

Attributes
protected
Definition Classes
Logging
def log: Logger

Attributes
protected
Definition Classes
Logging
def logDebug(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logDebug(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
def logError(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logError(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
def logInfo(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logInfo(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
def logName: String

Attributes
protected
Definition Classes
Logging
def logTrace(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logTrace(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
def logWarning(msg: ⇒ String, throwable: Throwable): Unit

Attributes
protected
Definition Classes
Logging
def logWarning(msg: ⇒ String): Unit

Attributes
protected
Definition Classes
Logging
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
val numBins: Int

if greater than 0, then the curves (ROC curve, PR curve) computed internally will be down-sampled to this many "bins".
if greater than 0, then the curves (ROC curve, PR curve) computed internally will be down-sampled to this many "bins". If 0, no down-sampling will occur. This is useful because the curve contains a point for each distinct score in the input, and this could be as large as the input itself -- millions of points or more, when thousands may be entirely sufficient to summarize the curve. After down-sampling, the curves will instead be made of approximately numBins points instead. Points are made from bins of equal numbers of consecutive points. The size of each bin is floor(scoreAndLabels.count() / numBins), which means the resulting number of bins may not exactly equal numBins. The last bin in each partition may be smaller as a result, meaning there may be an extra sample at partition boundaries.

Annotations
@Since( "1.3.0" )
def pr(): RDD[(Double, Double)]

Returns the precision-recall curve, which is an RDD of (recall, precision), NOT (precision, recall), with (0.0, 1.0) prepended to it.
Returns the precision-recall curve, which is an RDD of (recall, precision), NOT (precision, recall), with (0.0, 1.0) prepended to it.

Annotations
@Since( "1.0.0" )
See also
http://en.wikipedia.org/wiki/Precision_and_recall
def precisionByThreshold(): RDD[(Double, Double)]

Returns the (threshold, precision) curve.
Returns the (threshold, precision) curve.

Annotations
@Since( "1.0.0" )
def recallByThreshold(): RDD[(Double, Double)]

Returns the (threshold, recall) curve.
Returns the (threshold, recall) curve.

Annotations
@Since( "1.0.0" )
def roc(): RDD[(Double, Double)]

Returns the receiver operating characteristic (ROC) curve, which is an RDD of (false positive rate, true positive rate) with (0.0, 0.0) prepended and (1.0, 1.0) appended to it.
Returns the receiver operating characteristic (ROC) curve, which is an RDD of (false positive rate, true positive rate) with (0.0, 0.0) prepended and (1.0, 1.0) appended to it.

Annotations
@Since( "1.0.0" )
See also
http://en.wikipedia.org/wiki/Receiver_operating_characteristic
val scoreAndLabels: RDD[(Double, Double)]

an RDD of (score, label) pairs.
an RDD of (score, label) pairs.

Annotations
@Since( "1.3.0" )
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def thresholds(): RDD[Double]

Returns thresholds in descending order.
Returns thresholds in descending order.

Annotations
@Since( "1.0.0" )
def toString(): String

Definition Classes
AnyRef → Any
def unpersist(): Unit

Unpersist intermediate RDDs used in the computation.
Unpersist intermediate RDDs used in the computation.

Annotations
@Since( "1.0.0" )
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )

Related Doc: package evaluation

class BinaryClassificationMetrics extends Logging

Instance Constructors

new BinaryClassificationMetrics(scoreAndLabels: RDD[(Double, Double)])

new BinaryClassificationMetrics(scoreAndLabels: RDD[(Double, Double)], numBins: Int)

Value Members

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: Any): Boolean

def areaUnderPR(): Double

def areaUnderROC(): Double

final def asInstanceOf[T0]: T0

def clone(): AnyRef

final def eq(arg0: AnyRef): Boolean

def equals(arg0: Any): Boolean

def fMeasureByThreshold(): RDD[(Double, Double)]

def fMeasureByThreshold(beta: Double): RDD[(Double, Double)]

def finalize(): Unit

final def getClass(): Class[_]

def hashCode(): Int

def initializeLogIfNecessary(isInterpreter: Boolean): Unit

final def isInstanceOf[T0]: Boolean

def isTraceEnabled(): Boolean

def log: Logger

def logDebug(msg: ⇒ String, throwable: Throwable): Unit

def logDebug(msg: ⇒ String): Unit

def logError(msg: ⇒ String, throwable: Throwable): Unit

def logError(msg: ⇒ String): Unit

def logInfo(msg: ⇒ String, throwable: Throwable): Unit

def logInfo(msg: ⇒ String): Unit

def logName: String

def logTrace(msg: ⇒ String, throwable: Throwable): Unit

def logTrace(msg: ⇒ String): Unit

def logWarning(msg: ⇒ String, throwable: Throwable): Unit

def logWarning(msg: ⇒ String): Unit

final def ne(arg0: AnyRef): Boolean

final def notify(): Unit

final def notifyAll(): Unit

val numBins: Int

def pr(): RDD[(Double, Double)]

def precisionByThreshold(): RDD[(Double, Double)]

def recallByThreshold(): RDD[(Double, Double)]

def roc(): RDD[(Double, Double)]

val scoreAndLabels: RDD[(Double, Double)]

final def synchronized[T0](arg0: ⇒ T0): T0

def thresholds(): RDD[Double]

def toString(): String

def unpersist(): Unit

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

Inherited from Logging

Inherited from AnyRef

Inherited from Any

Ungrouped