data:image/s3,"s3://crabby-images/51771/517719ec9a9c99070a5e68ad9e9fd702c269c9c9" alt=""
data:image/s3,"s3://crabby-images/11f91/11f910a1c2731905d29a351b181efe5db61fa499" alt=""
data:image/s3,"s3://crabby-images/468df/468df89a0586caab7ef65ba6b596659253359940" alt=""
How does SRAM compare to regular data caches in terms of transfer speed and typical capacity/cost requirements?
data:image/s3,"s3://crabby-images/57bdd/57bdd7cf89a670e89304b28dd9e22d1e322e13dd" alt=""
I believe SRAM is the actual memory used in most CPU caches: https://en.wikipedia.org/wiki/Static_random-access_memory#In_computers.
Interestingly, some CPUs (mostly game consoles) use eDRAM (https://en.wikipedia.org/wiki/EDRAM), which is DRAM integrated onto the same module as the processor, and gives better density than SRAM (and higher speeds) at the penalty of additional fabrication cost.
data:image/s3,"s3://crabby-images/86d7d/86d7d48f0db8464fca5852279bcaa288fdfee39b" alt=""
@gmukobi do you know to what extent the program is "baked into the hardware"? As in, does the custom control unit essentially store a dump of the operations it needs to perform, or are the operations store elsewhere in the chip and the custom control unit knows how to retrieve them?
data:image/s3,"s3://crabby-images/3d8ff/3d8ffd4e35d9403fc44faa1fd862a445f3e4ba4c" alt=""
The fact that the special purpose processor (accelerator) illustrated in the right hand side graph doesn't have instruction selection and fetch/decode modules is an example of how hardware systems often trade off flexibility/easy to program with performance (e.g. the picture on the previous slide).
data:image/s3,"s3://crabby-images/3ee8b/3ee8b9cd80edff3e24ad27445923604a0c38c1df" alt=""
I was also wondering the implementation of how the instructions are hardcoded s.t. you don't need the fetch/decode / arbitrary instruction execution overhead @sanjayen
data:image/s3,"s3://crabby-images/9969b/9969b190d36fd1b9be1d3aa74fa9a6ad90e9dde2" alt=""
Typically in specialized hardware, there is no threading. (So no atomicity, barriers and etc)
Please log in to leave a comment.
From lecture: One important thing the special purpose accelerator doesn't have that the general processor does is instruction selection/fetching/decoding hardware. That's because a fully completed hardware accelerator (at least as shown here) is incapable of running arbitrary instructions from memory--the program that it will run is baked into the hardware itself through the custom control unit shown here and other mechanisms.