Definitions
Thread: Sequence of instruction execution activities resulting from the sequential step by step execution of an application is called Thread of execution.
Concurrent Revolution: According to the authors, parallel programs will be the programs that will enjoy significant performance improvements. The dramatically escalated advantages of parallel programs over sequential programs is termed as Concurrency Revolution.
Amdahl's Law: The fact that speedup achievable through parallel programming is limited by the parallelizable portion of the program.
Transparent Scalability: The ability to execute the same application code on different hardware with different amounts of execution resources.
Occupancy: The ratio of the number of warps assigned to an SM to the maximum number it supports.
Latency Tolerance or Latency Hiding: The mechanism of filling the latency time of operations from some threads with work from other threads.
Zero Overhead Thread Scheduling: The selection of warps that are ready for execution does not introduce any idle or wasted time into the execution timeline, which is referred to as zero-overhead thread scheduling.