From what I can gather, there is not much theory behind local search methods like Tabu Search, Simulated Annealing, Large Neighborhood Search, etc. By theory I mean some mathematical argument as to why these methods work in the cases that they do.
Despite that I feel there might exist some good theories about such methods that I would be interested in.
If someone knows of a good set of such references (maybe survey papers for instance), I would be grateful.