Skip to content
GitLab
Projects Groups Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in
  • bob.learn.tensorflow bob.learn.tensorflow
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 11
    • Issues 11
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 0
    • Merge requests 0
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • bobbob
  • bob.learn.tensorflowbob.learn.tensorflow
  • Issues
  • #58
Closed
Open
Issue created Jul 02, 2018 by Saeed SARFJOO@ssarfjooDeveloper

list(set(CLIENT_IDS)) is not reproducible

The code for taking client_ids from files is not reproducible. Specially, if you want to run two separate tasks for train and evaluation this is serious issue.

For example running the following code will produce different list each time:

import bob.db.voxforge
db = bob.db.voxforge.Database()
files = db.objects()
CLIENT_IDS = (str(f.client_id) for f in files)
CLIENT_IDS = list(set(CLIENT_IDS))
for id in CLIENT_IDS:
    print(id)

Simple solution can be sorting the CLIENT_IDS with CLIENT_IDS.sort()

To upload designs, you'll need to enable LFS and have an admin enable hashed storage. More information
Assignee
Assign to
Time tracking