Thank you for sending your enquiry! One of our team members will contact you shortly.
Thank you for sending your booking! One of our team members will contact you shortly.
Course Outline
Introduction to Programming Big Data with R (pbdR)
- Configuring your environment for pbdR
- Overview of functionalities and tools provided by pbdR
- Packages frequently used alongside pbdR for Big Data tasks
Message Passing Interface (MPI)
- Utilizing pbdR MPI 5
- Implementing parallel processing techniques
- Managing point-to-point communication
- Handling Matrix operations
- Performing Matrix summation
- Executing collective communication patterns
- Aggregating Matrices using the Reduce function
- Implementing Scatter and Gather operations
- Exploring other MPI communication methods
Distributed Matrices
- Constructing a distributed diagonal matrix
- Performing Singular Value Decomposition (SVD) on distributed matrices
- Building distributed matrices in parallel
Applications in Statistics
- Applying Monte Carlo Integration
- Importing Datasets
- Accessing data across all processes
- Broadcasting data from a single process
- Reading partitioned datasets
- Executing Distributed Regression
- Implementing Distributed Bootstrap methods
21 Hours
Testimonials (2)
The subject matter and the pace were perfect.
Tim - Ottawa Research and Development Center, Science Technology Branch, Agriculture and Agri-Food Canada
Course - Programming with Big Data in R
Michael the trainer is very knowledgeable and skillful about the subject of Big Data and R. He is very flexible and quickly customize the training meeting clients' need. He is also very capable to solve technical and subject matter problems on the go. Fantastic and professional training!.