Last updated: 2018-11-14

workflowr checks: (Click a bullet for more information)
  • R Markdown file: up-to-date

    Great! Since the R Markdown file has been committed to the Git repository, you know the exact version of the code that produced these results.

  • Repository version: 50219af

    Great! You are using Git for version control. Tracking code development and connecting the code version to the results is critical for reproducibility. The version displayed above was the version of the Git repository at the time these results were generated.

    Note that you need to be careful to ensure that all relevant files for the analysis have been committed to Git prior to generating the results (you can use wflow_publish or wflow_git_commit). workflowr only checks the R Markdown file, but you know if there are other scripts or data files that it depends on. Below is the status of the Git repository when the results were generated:
    
    Ignored files:
        Ignored:    .DS_Store
        Ignored:    .Rhistory
        Ignored:    .Rproj.user/
        Ignored:    data/.DS_Store
    
    Untracked files:
        Untracked:  analysis/chipexoeg.Rmd
        Untracked:  analysis/efsd.Rmd
        Untracked:  analysis/talk1011.Rmd
        Untracked:  data/chipexo_examples/
        Untracked:  data/chipseq_examples/
        Untracked:  talk.Rmd
        Untracked:  talk.pdf
    
    Unstaged changes:
        Modified:   analysis/fda.Rmd
        Modified:   analysis/sigma.Rmd
    
    
    Note that any generated files, e.g. HTML, png, CSS, etc., are not included in this status report because it is ok for generated content to have uncommitted changes.
Expand here to see past versions:
    File Version Author Date Message
    Rmd 50219af Dongyue Xie 2018-11-14 wflow_publish(“analysis/literature.Rmd”)
    html 2642d62 Dongyue Xie 2018-11-14 Build site.
    Rmd 7771721 Dongyue Xie 2018-11-14 wflow_publish(“analysis/literature.Rmd”)
    html 99b3deb Dongyue Xie 2018-10-13 Build site.
    Rmd 44e7e44 Dongyue Xie 2018-10-13 add tf
    html a21393a Dongyue Xie 2018-10-05 Build site.


This is a broad list of literature on smoothing Gaussian/non-Gaussian data.

Nonparametric methods

  1. KNN

  2. Kernel smootihng methods: Chapter 6 of ESL.

  3. Local regression(linear & higher order): Loader, C. (2006). Local regression and likelihood. Springer Science & Business Media.

Note: Local regression makes no global assumptions about the function but assume that locally it can be well approximated with a member of a simple class of parametric function. Only observations in certain window are used.

  1. Splines: regression splines, smoothing splines; More generally, reproducing kernel Hilbert space: Chapter 5 of ESL

  2. Locally adaptive estimators: wavelet( Mallat, S. (1999). A wavelet tour of signal processing. Elsevier ), Locally adaptive regression splines(A varaiant of smooting splines achieves better local adaptivity. Mammen, E., & van de Geer, S. (1997). Locally adaptive regression splines. The Annals of Statistics, 25(1), 387-413.), Trend filtering( Kim, S. J., Koh, K., Boyd, S., & Gorinevsky, D. (2009). l_1 Trend Filtering. SIAM review, 51(2), 339-360. ).

  3. Additive models: Sparse additive models, Generalized additive mixed models.

More on trend filtering and additive models:

Trend filtering:

Wang, Y. X., Sharpnack, J., Smola, A. J., & Tibshirani, R. J. (2016). Trend filtering on graphs. The Journal of Machine Learning Research, 17(1), 3651-3691.

Ramdas, A., & Tibshirani, R. J. (2016). Fast and flexible ADMM algorithms for trend filtering. Journal of Computational and Graphical Statistics, 25(3), 839-858. The R package for this algo is glmgen.

Tibshirani, R. J. (2014). Adaptive piecewise polynomial estimation via trend filtering. The Annals of Statistics, 42(1), 285-323.

Additive models:

Sadhanala, V., & Tibshirani, R. J. (2017). Additive Models with Trend Filtering. arXiv preprint arXiv:1702.05037.

Petersen, A., Witten, D., & Simon, N. (2016). Fused lasso additive model. Journal of Computational and Graphical Statistics, 25(4), 1005-1025.

Yin Lou, Jacob Bien, Rich Caruana & Johannes Gehrke (2016) Sparse Partially Linear Additive Models, Journal of Computational and Graphical Statistics, 25:4, 1126-1140

Generalized Sparse Additive Models

Chouldechova, A., & Hastie, T. (2015). Generalized additive model selection. arXiv preprint arXiv:1506.03850.

Binomial

Marchand, P., & Marmet, L. (1983). Binomial smoothing filter: A way to avoid some pitfalls of least‐squares polynomial smoothing. Review of scientific instruments, 54(8), 1034-1041.

Hansen, K. D., Langmead, B., & Irizarry, R. A. (2012). BSmooth: from whole genome bisulfite sequencing reads to differentially methylated regions. Genome biology, 13(10), R83.

Note: It actually uses local likelihood moother and assumes \(logit(\pi)\) is approximated by a second degree polynomial. They assume that data follow a binomial distribution and the parameters defining the polynomial are estimated by fitting a weighted generalized linear model to the data inside the genomic window.


This reproducible R Markdown analysis was created with workflowr 1.1.1