. We discuss parallel implementations of cg--methods for solving large scale systems in engineering applications. Furthermore we describe automatic optimizations for arbitrary clusters of workstations. We consider each cluster as a finite set of processor--memory pairs linked together with an interconnection network. They are modelled as a LogP machine, which is extended to use functions instead of constants for all important parameters. The optimizations guarantee runtimes within a small...