Course details
Computation Systems Architectures
AVS Acad. year 2024/2025 Winter semester 5 credits
The course covers architecture of modern computational systems composed of universal as well as special-purpose processors and their memory subsystems. Instruction-level parallelism is studied on scalar and superscalar processors. Then the processors with thread-level parallelism are discussed. Data parallelism is illustrated on SIMD streaming instructions and on graphical processors. Programming for shared memory systems in OpenMP follows and then the most proliferated multi-core multiprocessors and the advanced NUMA systems are described. Finally, the generic architecture of the graphics processing units and basic programming techniques using OpenMP are also covered. Techniques of low-power processors are also explained.
Guarantor
Course coordinator
Language of instruction
Completion
Time span
- 26 hrs lectures
- 12 hrs pc labs
- 14 hrs projects
Assessment points
- 60 pts final exam (written part)
- 10 pts mid-term test (written part)
- 30 pts projects
Department
Lecturer
Instructor
Jaroš Marta, Ing., Ph.D. (DCSY)
Kuník Oliver, Ing. (DCSY)
Olšák Ondřej, Ing. (DCSY)
Learning objectives
To familiarize yourself with the architecture of modern computational systems based on x86, ARM and RISC-V multicore processors in configurations with uniform (UMA) and non-uniform (NUMA) shared memory, often accompanied with a GPU accelerator. To understand hardware aspects of computational systems that have a significant impact on the application performance and power consumption. To be able to assess computing possibilities of a particular architecture and to predict the performance of applications. To clarify the role of a compiler and its cooperation with processors. To be able to orientate oneself on the computational system market, to evaluate and compare various systems.
Overview of the architecture of modern computational systems, their capabilities, limits and future trends. The ability to estimate performance of software applications on a given computer system, identify performance issues and propose their rectification. Practical user experience with supercomputers.
Understanding of hardware limitations having impact on the efficiency of software solutions.
Prerequisite knowledge and skills
Von-Neumann computer architecture, computer memory hierarchy, cache memories and their organization, programming in assembly and in C/C++, compiler's tasks and functions.
Study literature
- Hennessy, J.L., Patterson, D.A.: Computer Architecture - A Quantitative Approach. 5. vydání, Morgan Kaufman Publishers, Inc., 2012, 1136 s., ISBN 1-55860-596-7. download.
- Baer, J.L.: Microprocessor Architecture. Cambridge University Press, 2010, 367 s., ISBN 978-0-521-76992-1. info.
- van der Pas, R., Stotzer, E., and Terboven, T.: Using OpenMP-The Next Step, MIT Press Ltd, ISBN 9780262534789, 2017. info.
- Materiály ke kurzu Computer Science 152: Computer Architecture and Engineering. http://inst.eecs.berkeley.edu/~cs152/sp13/
- Agner Fog: Software optimization resources.
- Aktuální PPT prezentace přednášek v Elearningu.
Fundamental literature
- Baer, J.L.: Microprocessor Architecture. Cambridge University Press, 2010, 367 s., ISBN 978-0-521-76992-1.
- Hennessy, J.L., Patterson, D.A.: Computer Architecture - A Quantitative Approach. 5. vydání, Morgan Kaufman Publishers, Inc., 2012, 1136 s., ISBN 1-55860-596-7.
- van der Pas, R., Stotzer, E., and Terboven, T.: Using OpenMP-The Next Step, MIT Press Ltd, ISBN 9780262534789, 2017.
Syllabus of lectures
- Scalar processors, pipelined instruction processing and compiler assistance.
- Superscalar processors, dynamic instruction scheduling.
- Data flow through the hierarchy of cache memories.
- Branch prediction, optimization of instruction and data fetching.
- Processors with data level parallelism.
- Multi-threaded and multi-core processors.
- Loop parallelism and code vectorization.
- Functional parallelism and acceleration of recursive algorithms.
- Synchronization on systems with shared memory.
- Algorithm for cache coherency.
- Architectures with distributed shared memory.
- Architecture and programming of graphics processing units.
- Low power processors and techniques.
Syllabus of computer exercises
- Performance measurement of sequential code
- Cache blocking, loop swapping and unrolling
- OpenMP 4.0 vectorization
- OpenMP loops
- OpenMP tasks
- OpenMP sections and mutual exclusion
Syllabus - others, projects and individual work of students
- Performance evaluation and code optimization using OpenMP.
- Development of an application in OpenMP on a NUMA node.
Progress assessment
Assessment of two projects, 14 hours in total and, computer laboratories and a midterm examination.
- Missed labs can be substituted in alternative dates.
Schedule
Day | Type | Weeks | Room | Start | End | Capacity | Lect.grp | Groups | Info |
---|---|---|---|---|---|---|---|---|---|
Mon | comp.lab | 4., 5., 6., 9., 10., 11. of lectures | N104 | 14:00 | 15:50 | 20 | 1MIT 2MIT | xx | Jaroš |
Mon | comp.lab | 4., 5., 6., 9., 10., 11. of lectures | N105 | 14:00 | 15:50 | 20 | 1MIT 2MIT | xx | Chlebík |
Mon | comp.lab | 4., 5., 6., 9., 10., 11. of lectures | N104 | 16:00 | 17:50 | 20 | 1MIT 2MIT | xx | Jaroš |
Mon | comp.lab | 4., 5., 6., 9., 10., 11. of lectures | N105 | 16:00 | 17:50 | 20 | 1MIT 2MIT | xx | Chlebík |
Mon | comp.lab *) | lectures | N105 | 18:00 | 19:50 | 20 | 1MIT 2MIT | xx | |
Mon | comp.lab *) | 1., 2., 3., 4., 5., 6., 8., 9., 10., 11., 12., 13. of lectures | N104 | 18:00 | 19:50 | 20 | 1MIT 2MIT | xx | Termín nebude otevřen |
Tue | comp.lab | 4., 5., 6., 9., 10., 11. of lectures | N104 | 08:00 | 09:50 | 20 | 1MIT 2MIT | xx | Chlebík |
Tue | comp.lab | 4., 5., 6., 9., 10., 11. of lectures | N105 | 08:00 | 09:50 | 20 | 1MIT 2MIT | xx | Kuník |
Tue | comp.lab | 4., 5., 6., 9., 10., 11. of lectures | N104 | 12:00 | 13:50 | 20 | 1MIT 2MIT | xx | Chlebík |
Tue | comp.lab | 4., 5., 6., 9., 10., 11. of lectures | N105 | 12:00 | 13:50 | 20 | 1MIT 2MIT | xx | Kuník |
Wed | comp.lab | 4., 5., 6., 9., 10., 11. of lectures | N104 | 14:00 | 15:50 | 20 | 1MIT 2MIT | xx | Jaroš |
Wed | comp.lab | 4., 5., 6., 9., 10., 11. of lectures | N105 | 14:00 | 15:50 | 20 | 1MIT 2MIT | xx | Kuník |
Fri | lecture | lectures | E104 E105 E112 | 08:00 | 09:50 | 294 | 1MIT 2MIT | NBIO - NSPE NISD - NISY NSEC - NGRI NVER xx | Jaroš |
Course inclusion in study plans