Examen

Multicore and GPU Programming: An Integrated Approach, Second Edition (Suppl. 1 of 3, Instructor Solution Manual, Solutions) Complete A+ Solutions.

Puntuación

Vendido

Páginas

215

Grado

A+

Subido en

09-01-2025

Escrito en

2024/2025

Multicore and GPU Programming: An Integrated Approach, Second Edition (Suppl. 1 of 3, Instructor Solution Manual, Solutions)

Institución

Multicore And GPU Programming 2nd Edition

Grado

Multicore And GPU Programming 2nd Edition

Ups! No podemos cargar tu documento ahora. Inténtalo de nuevo o contacta con soporte.

Informar violación de derechos de autor

Libro relacionado

Gerassimos Barlas Multicore and GPU Programming

Edición:2022
ISBN:9780128141212
Edición:Desconocido

Escuela, estudio y materia

Institución: Multicore And GPU Programming 2nd Edition
Grado: Multicore And GPU Programming 2nd Edition

Información del documento

Subido en: 9 de enero de 2025
Número de páginas: 215
Escrito en: 2024/2025
Tipo: Examen
Contiene: Preguntas y respuestas

Temas

solution manual
an integrated approach 2nd edition
multicore and gpu programming
complete multicore and gpu programming an integra
multicore and gpu programming an integrated appro

Vista previa del contenido

SOLUTION MANUAL

Multicore & GPU Programming : An Integrated
Approach, 2e

Gerassimos Barlas

,Contents

Contents 2

1 Introduction 5

2 Multicore and Parallel Program Design 9

3 Threads and Concurrency in standard C++ 13

4 Parallel data structures 57

5 Distributed memory programming 61

6 GPU Programming 117

7 GPU and Accelerator Programming : OpenCL 143

8 Shared-memory programming : OpenMP 169

9 The Thrust Template Library 183

10 High-level multi-threaded programming with the Qt library 199

11 Load Balancing 205

3

,for more solution manuals,visit Library Genesis: libgen.is, libgen.st, libgen.rs, and forum.mhut.org

Chapter 1

Introduction

Exercises
1. Study one of the top 10 most powerful supercomputers in the world. Dis-
cover:

What kind of operating system does it run?
How many CPUs/GPUs is it made of?
What is its total memory capacity?
What kind of software tools can be used to program it?

Answer
Students should research the answer by visiting the Top 500 site and -if
available- the site of one of the reported systems.

2. How many cores are inside the top GPU offerings from NVidia and AMD?
What is the GFlop rating of these chips?
Answer N/A.

3. The performance of the most powerful supercomputers in the world is
usually reported as two numbers Rpeak and Rmax, both in TFlops (tera
floating point operations per second) units. Why is this done? What
are the factors reducing performance from Rpeak to Rmax? Would it be
possible to ever achieve Rpeak?
Answer
This is done because the peak performance is unattainable. Sustained,
measured performance on specific benchmarks, is a better indicator of the
true machine potential.
The reason these are different is communication overhead.
Rpeak and Rmax could never be equal. Extremely compute-heavy ap-
plications, that have no inter-node communications, could asymptotically
approach Rpeak if they were to run for a very long time. A very long
execution time is required to diminish the influence of the start-up costs.

5

, 6 CHAPTER 1. INTRODUCTION

4. A sequential application with a 20% part that must be executed sequen-
tially, is required to be accelerated five-fold. How many CPUs are required
for this task?
Answer
This requires the application of Amdahl’s law. The part that can be
parallelized is α = 1 −
1
20% = 80%. The speedup predicted by Amdahl’s
law is speedup = α .
1−α+ N
Achieving a three-fold speedup requires that:

1 1 0.8 1 0.8
=3⇒ =3⇒ = − 0.2 ⇒ N = 1 =6
1 − α +N
0.8
α 0.2 + N N 3 3 − 0.2
(1.1)
Achieving a 5-fold speedup requires that:
1 0.8 0.8
0.8 = 5 ⇒ N = 1
= =∞ (1.2)
5 − 0.2
0.2 + N 0
So, it is impossible to achieve a 5-fold speedup, according to Amdahl’s
law.
5. A parallel application running on 5 identical machines, has a 10% sequen-
tial part. What is the speedup relative to a sequential execution on one of
the machines? If we would like to double that speedup, how many CPU
would be required?
Answer
This requires the application of Gustafson-Barsis’ law as the information
relates to a parallel application. The parallel part is α = 1— 10% = 90%.
The speedup over a single machine is speedup — = 1 α+N · α = .1+5· 0.9 =
4.6.
Doubling the speedup would require .1 + ·N 0.9 = 9.2 ⇒ N = 0.9 = 10.1
9.1

machines. As N has to be an integer, we have to round-up to the closest
integer, i.e. N = 11.
6. An application with a 5% non-parallelizable part, is to be modified for
parallel execution. Currently on the market there are two parallel ma-
chines available: machine X with 4 CPUs, each CPU capable of executing
the application in 1hr on its own, and, machine Y with 16 CPUs, with
each CPU capable of executing the application in 2hr on its own. Which
is the machine you should buy, if the minimum execution time is required?
Answer
As the information provided relates to a sequential application, we have
to apply Amdahl’s law. The execution time for machine X is:
αT = 0.05 1hr + 0.95 · 1hr = 0.2875hr (1.3)
tX = (1 − α)T + ∗
N 4
The execution time for machine Y is:
αT = 0.05 2hr + 0.95 · 2hr = 0.21875hr (1.4)
tY = (1 − α)T + ∗
N 16

$17.99

Accede al documento completo:

100% de satisfacción garantizada

Inmediatamente disponible después del pago

Tanto en línea como en PDF

No estas atado a nada

Conoce al vendedor

LectArnold

3.2

(57)

Conoce al vendedor

LectArnold Liberty university

Ver perfil

Seguir

Vendido

270

Miembro desde

1 año

Número de seguidores

Documentos

1466

Última venta

23 horas hace

3.2

57 reseñas

Recientemente visto por ti

Por qué los estudiantes eligen Stuvia

Creado por compañeros estudiantes, verificado por reseñas

Calidad en la que puedes confiar: escrito por estudiantes que aprobaron y evaluado por otros que han usado estos resúmenes.

¿No estás satisfecho? Elige otro documento

¡No te preocupes! Puedes elegir directamente otro documento que se ajuste mejor a lo que buscas.

Paga como quieras, empieza a estudiar al instante

Sin suscripción, sin compromisos. Paga como estés acostumbrado con tarjeta de crédito y descarga tu documento PDF inmediatamente.

“Comprado, descargado y aprobado. Así de fácil puede ser.”

Alisha Student

Preguntas frecuentes

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

100% de satisfacción garantizada: ¿Cómo funciona?

Nuestra garantía de satisfacción le asegura que siempre encontrará un documento de estudio a tu medida. Tu rellenas un formulario y nuestro equipo de atención al cliente se encarga del resto.

Who am I buying this summary from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller LectArnold. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy this summary for $17.99. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews) 45,681 summaries were sold in the last 30 days Founded in 2010, the go-to place to buy summaries for 16 years now