Calling add_default_extractors twice should be harmless since the first set of extractors will match.
Now they use an XML format instead of JSON.
We have the minsize test now.
These fluctuate regularly.