Skip to content

Issues: modelscope/data-juicer

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

运行process时报错KeyError:"text_key" question Further information is requested
#469 by huangkaipeng4399 was closed Nov 1, 2024
3 tasks done
请问有没有提供 查看被算子筛选掉的数据 的功能 question Further information is requested
#466 by huangkaipeng4399 was closed Oct 31, 2024
3 tasks done
[Bug]: Paper link error bug Something isn't working documentation Improvements or additions to documentation
#438 by ForeverNewLee was closed Oct 14, 2024
3 tasks done
[Bug]: JupyterLab Official sample error bug Something isn't working
#437 by Night-Quiet was closed Sep 26, 2024
3 tasks done
why often happen: One of the subprocesses has abruptly died during map operation? question Further information is requested stale-issue
#430 by strongcc was closed Oct 11, 2024
3 tasks done
执行 python tools/process_data.py --config train.yaml 命令 question Further information is requested
#425 by abchbx was closed Sep 10, 2024
3 tasks done
AssertionError bug Something isn't working question Further information is requested
#420 by abchbx was closed Sep 9, 2024
3 tasks done
[Bug]: undefined symbol: _ZN3c104cuda9SetDeviceE bug Something isn't working stale-issue
#419 by lh61500 was closed Oct 6, 2024
3 tasks done
[Feat] Data-Juicer as a Service enhancement New feature or request stale-issue
#417 by drcege was closed Sep 30, 2024
2 tasks done
[Feat] Enhance type hints and parameter validation bug Something isn't working enhancement New feature or request
#416 by drcege was closed Sep 11, 2024
analyzer or analyzer? question Further information is requested
#409 by lilqz66 was closed Sep 3, 2024
3 tasks done
Heavy dependency of Data-Juicer enhancement New feature or request
#398 by BeachWang was closed Sep 25, 2024
[Bug]: 去重的hash计算卡在100%上,一直不过滤 bug Something isn't working priority:high in high priority
#387 by xiafeng-nb was closed Sep 10, 2024
3 tasks done
[Bug]: 使用图片相关算子在显存充足的情况下 报OOM bug Something isn't working competition:BetterSynth issues about Better Synth competition of Data-Juicer
#378 by tian969 was closed Aug 2, 2024
3 tasks done
[Feat] Automatically Handle BrokenPipeError Caused by Limited Memory enhancement New feature or request stale-issue
#377 by yxdyc was closed Sep 29, 2024
2 tasks done
如何根据算子提前准备好需要资源? question Further information is requested
#376 by tian969 was closed Jul 30, 2024
3 tasks done
Efficient processing OPs for scanned images and pdf dj:multimodal issues/PRs about multimodal data processing dj:op issues/PRs about some specific OPs enhancement New feature or request stale-issue
#375 by yxdyc was closed Sep 29, 2024
2 tasks done
[Bug]: librosa not work with np>1 bug Something isn't working dj:dist issues/PRs about distributed data processing dj:multimodal issues/PRs about multimodal data processing stale-issue
#372 by drcege was closed Aug 23, 2024
[Bug]: MODEL_ZOO is not reused in subprocesses enhancement New feature or request
#370 by drcege was closed Aug 21, 2024
[Bug]: Memory leak in video OP bug Something isn't working
#369 by BeachWang was closed Aug 1, 2024
3 tasks done
ProTip! Updated in the last three days: updated:>2024-11-07.