Home
See research for previous writing
- Front Matter
- Title
- Abstract
- ToC, LoF, LoT
- Introduction
- Broad introduction to topic
- Problem Statement
- Proposal
- Project Overview
- Means of development
- Emphasise developer level
- Match to what has been produced (work backwards a little)
- Beneficiaries of this research
- History & context
- Literature Review
- Overview: review method, limitations, scope
- Literature, grouped by similarities
- Packages
- Algorithms
- Brief theory chapter
- Motivating the following chapter’s aspects, objectives which will be
linked in the next chapter to each aspect
- Defining a distributed statistical modelling system for R
- What does it mean to be distributed?
- Properties that distributed object/chunk/computation etc. must
possess
- What can’t be done
- Direction: Statistical Modelling
- High-level (like mathematical definition)
- Response
- Illustrative Problem
- Aspect: Object System
- Aspect: Computation
- Aspect: Concurrency
- Aspect: Reference
- Experiments in implementation of System
- Motivating the following outline of eventual implementation (Tying
in with aspects from previous chapter)
- Bring over thoughts on implementations of concurrency, carried over
from the concurrency aspect section in the previous chapter
- RServe-based System - Current and Proposed
Information Structures - Experiment: Eager Distributed
Object Supplement - Experiment: Eager Distributed
Object Precursory Report - Experiment: Distributed Decision
Tree
- Redis-based System - Initial
Distributed Object Experimentation with a Message Queue Communication
System - Report on Current Chunk
Architecture - Chunk ID Origination and
Client-Server Communication - Message Queues for Communication in a
Distributed Object System - Initial Chunk Experimentation with a
Message Queue Object System - Inter-node communication with
Redis
- DistObj System - distObj System
Initialisation and Input - DistObje Non-Assigned Data Return -
Description of distObj Client-Server Call
Process
- Outline of System Implementation
- Overview
- orcv
- chunknet
- largescaler
- largescalemodelr
- System Capabilities
- Application
- ADMM
- lm, gml, admm, xgboost, boosting
- What is more efficient, what is less (data movement)
- Theoretical comparison with other Systems
- Benchmarks
- e.g. Against Spark table, foreach etc.
- Validity and justification of benchmarks
- Extensions to the system
- e.g. shuffle, index as in-system extensions
- Open/Closed principle
- Extensions serving to validate the system
- Recommendations
- Comparison between expectation and reality
- Future Work
- Appendices
- Documentation
- Source Code
- Bibliography