Analysis of missing data

Main page (in Portuguese)

“The most pressing task, in my opinion, is placing further emphasis on the general recognition and understanding, at a conceptual level, of the necessity of properly dealing with the missing-data mechanism, as part of our ongoing emphasis on the importance of the data collection process in any meaningful statistical analysis. The missing-data mechanism is in the blood of statistics, and it is the nastiest and the most deceptive cell, especially for nonstatisticians - why on earth should anyone be concerned with data that one does not even have?” Meng (2000)

“(Colonel Ross) Is there any other point to which you would wish to draw my attention?
(Holmes) To the curious incident of the dog in the night-time.
(Ross) The dog did nothing in the night-time.
  That was the curious incident!, remarked Sherlock Holmes.” Dawid e Dickey (1977)


Software

R package: ACD (up to 2011, called as Catdata in the manuscripts below).
Performs analysis of categorical data with missing (or complete) responses via product-multinomial distributions.
Source code. R package.

Manuscripts

Poleto, F.Z., Paulino, C.D., Singer, J.M. and Molenberghs, G. (2014). Semi-parametric Bayesian analysis of binary responses with a continuous covariate subject to non-random missingness. To appear in Statistical Modelling.

Poleto, F. Z., Singer, J. M. and Paulino, C. D. (2014). A product-multinomial framework for categorical data analysis with missing responses. Brazilian Journal of Probability and Statistics 28, 109-139. doi: 10.1214/12-BJPS198. R code to reproduce the analyses of the manuscript.

Poleto, F.Z., Molenberghs, G., Paulino, C.D. and Singer, J.M. (2011). Sensitivity analysis for incomplete continuous data. TEST 20, 589-606. doi: 10.1007/s11749-010-0219-x.

Poleto, F. Z., Paulino, C. D., Molenberghs, G. and Singer, J. M. (2011). Inferential implications of over-parameterization: a case study in incomplete categorical data. International Statistical Review 79, 92-113. doi: 10.1111/j.1751-5823.2011.00130.x.

Poleto, F. Z. (2011). Análise de dados categorizados com omissão em variáveis explicativas e respostas (Analysis of categorical data with missingness in explanatory and response variables, in Portuguese). Tese de doutorado (Ph.D. thesis). Instituto de Matemática e Estatística, Universidade de São Paulo, Brazil.

Poleto, F. Z., Singer, J. M. and Paulino, C. D. (2011). Comparing diagnostic tests with missing data. Journal of Applied Statistics 38, 1207-1222. doi: 10.1080/02664763.2010.491860. R code to reproduce the analyses of the manuscript.

Poleto, F. Z., Singer, J. M. and Paulino, C. D. (2011). Missing data mechanisms and their implications on the analysis of categorical data. Statistics and Computing 21, 31-43. doi: 10.1007/s11222-009-9143-x.

Singer, J. M., Poleto, F. Z. and Paulino, C. D. (2007). Catdata: software for analysis of categorical data with complete or missing responses. Actas de la XII Reunión Científica del Grupo Argentino de Biometría y I Encuentro Argentino-Chileno de Biometría.

Poleto, F. Z. (2007). Comandos (em R) para reproduzir as análises de exemplos do livro Análise de Dados Categorizados de Paulino e Singer (2006) [Commands (in R) to reprocuce the analyses of the examples of the book Analysis of Categorical Data by Paulino and Singer (2006), in Portuguese]. Manuscrito não publicado (Unpublished manuscript). Código R para reproduzir as análises do manuscrito (R code to reproduce the analyses of the manuscript).

Poleto, F. Z., Singer, J. M. and Paulino, C. D. (2007). Analyzing categorical data with complete or missing responses using the Catdata package. Unpublished vignette for the R package. R code to reproduce the analyses of the manuscript.

Poleto, F. Z., Singer, J. M. and Paulino, C. D. (2007). A product-multinomial framework for categorical data analysis with missing responses. Technical report RT-MAE-2007-07. Instituto de Matemática e Estatística, Universidade de São Paulo, Brazil.

Poleto, F. Z. (2006). Análise de dados categorizados com omissão (Analysis of categorical data with missingness, in Portuguese). Dissertação de mestrado (M.Sc. dissertation). Versão corrigida (corrected version). Instituto de Matemática e Estatística, Universidade de São Paulo, Brazil. Código R para reproduzir as análises do manuscrito (R code to reproduce the analyses of the manuscript): Ex. 1, Ex. 2 - parte 1, Ex. 2 - parte 2, Ex. 3, Ex. 4 e Ex. 5.

 

References

Dawid, A. P. and Dickey, J. M. (1977). Likelihood and bayesian inference from selectively reported data. Journal of the American Statistical Association 72, 845-850.

Meng, X.-L. (2000). Missing data: dial M for ???. Journal of the American Statistical Association 95, 1325-1330.

Paulino, C. D. and Singer, J. M. (2006). Análise de dados categorizados (Analysis of categorical data, in Portuguese). São Paulo: Edgard Blücher.

Main page (in Portuguese)


Frederico Zanqueta Poleto <frederico@poleto.com> 's home page. Last modified: March 04th, 2014.