* fixed SynthBenchmark for slow GPUs
* try with increasing number of work amount
* additionally tack total work time
* made WorkScale a float
* removed BenchMarkGraph (was for internal testing))
* updated version to 0.95
[CL 2291452 by Martin Mittring in Main branch]