Here's a small update (yes, already!)
I've optimized the graphics decoding so that it takes less temporary ram (6mb). The temporary ram
is now also allocated before the ram for everything except the graphics, so really there is only
about 2mb of extra ram used for graphics decoding (down from 24mb extra).
The driver now runs flawlessly on FBA-XXX.