Runs the direct interactions or average pairwise distance test
on each of the collections and the core_modules.
- Returns:
- evaluation (dictionary) - a mapping of t -> k -> the network evaluation tuple of each collection (see network_tests.evaluate_collection() for details)