Comparing distributions: l1 geometry improves kernel two-sample testing