Home

A Platform for Large-scale Statistical Modelling using R

Jason Cairns

2021-05-18

1 Introduction

Package:            largeScaleR
Type:               Package
Title:              Provides a Distributed Framework 
                    for Statistical Modelling
Version:            0.4

2 Motivation

3 Specifications

4 Local Approaches

4.1 Using R

5 disk.frame

File-backed dataframes

6 Distributed Approaches

6.1 Outside of R

7 MapReduce with Hadoop

8 Distributed Approaches

8.1 Using R

9 SNOW

Split list and map over multiple processes

10 Preliminary Results


11 Preliminary Results in Detail


12 Preliminary Results in Detail

13 Main Demonstration

14 Challenges

15 Further Work

16 Contact

GitHub