- Joined
- May 2, 2013
- Messages
- 170 (0.04/day)
It has nothing to do with PhysX, this is because how different GCN and Maxwell GPU architectures are ... GCN has fantastic theoretical throughput and if the type of a problem/algorithm is suitable it really shines. There are some instances where it's less so (as you noticed with particle simulation) because scheduler can't reshuffle the instructions to keep all vector units inside GCN cores fully busy all the time.
Nvidia has more versatile architecture IMO although with less theoretical throughput. They did fantastic job refining it so that instruction scheduling is much simpler and they improved cache subsystem by introducing another level of caching.
That is incorrect. GCN is more suited for general puropose processing and that's because scheduling is almost entirely driver independent in Nvidia's Maxwell. Nvidia since Fermi has moved most of the scheduling to their drivers and according to them, scheduling takes up less then 4.5% of a Maxwell SM cluster area.
The reason the particle simulation test runs better on AMD is because it's a probably a port from the CUDA version of that test.
Last edited: