On Asymptotic Distributions and Confidence Intervals for LIFT Measures in Data Mining

Jiang, Wenxin; Zhao, Yu

On Asymptotic Distributions and Confidence Intervals for LIFT Measures in Data Mining

2015

Jiang, Wenxin | Zhao, Yu

A LIFT measure, such as the response rate, lift, or the percentage of captured response, is a fundamental measure of effectiveness for a scoring rule obtained from data mining, which is estimated from a set of validation data. In this article, we study how to construct confidence intervals of the LIFT measures. We point out the subtlety of this task and explain how simple binomial confidence intervals can have incorrect coverage probabilities, due to omitting variation from the sample percentile of the scoring rule. We derive the asymptotic distribution using some advanced empirical process theory and the functional delta method in the Appendix. The additional variation is shown to be related to a conditional mean response, which can be estimated by a local averaging of the responses over the scores from the validation data. Alternatively, a subsampling method is shown to provide a valid confidence interval, without needing to estimate the conditional mean response. Numerical experiments are conducted to compare these different methods regarding the coverage probabilities and the lengths of the resulting confidence intervals.

Mostrar más [+]

Palabras clave de AGROVOC

data analysis equations models

Información bibliográfica

Publicado en

Journal of the American Statistical Association

Volumen 110 Edición 512 Paginación 1717 - 1725 ISSN 1537-274X

Editorial

Taylor & Francis

Otras materias

Confidence interval; %response; Validation data; Empirical process; Subsampling; Functional delta method

Idioma

Inglés

Tipo

Journal Article; Text

En AGRIS desde: 2024-02-28

Formato: MODS

Proveedor de Datos

Este registro bibliográfico ha sido proporcionado por National Agricultural Library

Descubra la colección de este proveedor de datos en AGRIS

Enlaces

DOI DOI https://dx.doi.org/10.1080/01621459.2014.993080

Buscar en Google Scholar

Si observa algún dato incorrecto en este registro bibliográfico, póngase en contacto con nosotros en [email protected]

AGRIS - Sistema Internacional para la Ciencia y Tecnología Agrícola

Share

On Asymptotic Distributions and Confidence Intervals for LIFT Measures in Data Mining

2015

Palabras clave de AGROVOC

Información bibliográfica