Commit 1b84b956 authored by Tiago de Freitas Pereira's avatar Tiago de Freitas Pereira
Browse files

These operations should be delayed to avoid larger than memory issues

parent ffa826bb
Pipeline #52141 passed with stage
in 16 minutes and 31 seconds
...@@ -165,7 +165,7 @@ def get_split_dataframe(filename): ...@@ -165,7 +165,7 @@ def get_split_dataframe(filename):
genuines = df[df.probe_subject_id == df.bio_ref_subject_id] genuines = df[df.probe_subject_id == df.bio_ref_subject_id]
impostors = df[df.probe_subject_id != df.bio_ref_subject_id] impostors = df[df.probe_subject_id != df.bio_ref_subject_id]
return impostors.compute(), genuines.compute() return impostors, genuines
def split_csv_writer(filename): def split_csv_writer(filename):
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment