Created attachment 5675 [details]
patch against Debian's binutils-220.127.116.1110327
GNU ld takes a lot of memory when linking the Glasgow Haskell Compiler (ghc), and many programs produced by this compiler. These programs have the characteristic that they consist of many, literally thousands, small object files with perhaps a dozen symbols each.
Version: GNU ld (GNU Binutils) 18.104.22.16810327
This topic comes up occasionally on haskell mailing lists, for example, http://firstname.lastname@example.org/msg18215.html
I have a patch (attached below) that reduces the memory usage for linking ghc from almost 440 MB to 190 MB on x86_64. I did not see any negative impact on performance, but admittedly I did not try very hard to measure it (I really expect no discernible impact: the extra work is limited to a couple of allocations and copying of memory, and ld does quite a bit of other, heavier lifting. The primary effect, namely the reduced working size of the program, can only help performance.) For ghc, link time improved very slightly from about 1.9 to 1.8 seconds.
The patch works by changing the default hashtable size of libbfd from 4k entries to 31 entries, and increasing the granularity of hash table sizes by adding more intermediate sizes. No code is changed at all. There is nothing magical about the 31 except that going below 31 did not seem to improve memory usage anymore.
Module name: src
Changes by: email@example.com 2011-06-03 16:16:32
bfd : ChangeLog hash.c
* hash.c (higher_primer_number): Add more, small, prime numbers.
Thanks for reporting this problem. I have applied the second half of your patch - adding more small prime values to the arrays in hash.c. But there is no need for the first part of the patch (changing the value of DEFAULT_SIZE). You can do this from the linker command line. vis:
You might also wish to consider adding the --reduce-memory-overheads option to the linker command line as well.