Add a heap HWM to the Thread*, for compiled code.

Also fix a bug in thread detach, and implement the thread exit callback.

Destroy our pthread_mutex_t instances, and check for success. (This will
catch us deleting locked Mutex instances.)

Change-Id: I26cf8117b825234f6c790e0cf70b2c025a743f84
