See
http://pasky.or.cz/go/pachi-tr.pdf Fig. 4 (look only at the black line with circle points) for one diagram that shows Elo differences (100 Elo == one rank) in performance of the same software with just different number of playouts (simulations).
A rough approximation is that single doubling of the number of simulations should increase the strength of the program by single stone.
In practice, it's more complicated. First, this strength increase will not happen indefinitely as each program has some inherent strength limits related to systematic mishandling of various specific situations. So you will see a plateau at some point.
Second, each program has a unique mix of heuristics and some heuristics will improve strength only with small amount of situations (covering some glaring gaps that more simulations will solve easily) while other heuristics will kick in more the more simulations are available. So some programs will scare better and some will scare worse, and the differences are actually very big; designing a program that will work well on weak hardware and a program that will work well on a supercomputer are different tasks and a single program may not be good at both. (E.g. of the opensource programs, Fuego is better on older computers but Pachi would likely beat it on top-end computers.)