Results with new version are worse than those in the documentation

Running spear now gives results slightly different than those in the documentation (https://pythonhosted.org/bob.bio.spear/baselines.html)

DEV: EER = 1.89%
EVAL: HTER = 1.56%

I re-ran two baseline experiments:

./bin/verify.py -g 'demanding'  -d voxforge -p energy-2gauss -e mfcc-60 -a gmm-voxforge -s ubm_gmm_google_groups --groups {dev,eval}

./bin/verify_gmm.py -d voxforge -p energy-2gauss -e mfcc-60 -a gmm-voxforge -s ubm_gmm_sge --groups {dev,eval} -g 'demanding' -vvv

And getting the following results:

DEV: EER = 1.981%
EVAL: HTER = 2.074%