-
Notifications
You must be signed in to change notification settings - Fork 175
Issues: modelscope/data-juicer
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
运行process时报错KeyError:"text_key"
question
Further information is requested
#469
by huangkaipeng4399
was closed Nov 1, 2024
3 tasks done
请问有没有提供 查看被算子筛选掉的数据 的功能
question
Further information is requested
#466
by huangkaipeng4399
was closed Oct 31, 2024
3 tasks done
[Bug]: Paper link error
bug
Something isn't working
documentation
Improvements or additions to documentation
#438
by ForeverNewLee
was closed Oct 14, 2024
3 tasks done
[Bug]: JupyterLab Official sample error
bug
Something isn't working
#437
by Night-Quiet
was closed Sep 26, 2024
3 tasks done
why often happen: One of the subprocesses has abruptly died during map operation?
question
Further information is requested
stale-issue
#430
by strongcc
was closed Oct 11, 2024
3 tasks done
执行 python tools/process_data.py --config train.yaml 命令
question
Further information is requested
#425
by abchbx
was closed Sep 10, 2024
3 tasks done
[Bug]: undefined symbol: _ZN3c104cuda9SetDeviceE
bug
Something isn't working
stale-issue
#419
by lh61500
was closed Oct 6, 2024
3 tasks done
[Feat] Data-Juicer as a Service
enhancement
New feature or request
stale-issue
#417
by drcege
was closed Sep 30, 2024
2 tasks done
[Feat] Enhance type hints and parameter validation
bug
Something isn't working
enhancement
New feature or request
#416
by drcege
was closed Sep 11, 2024
analyzer or analyzer?
question
Further information is requested
#409
by lilqz66
was closed Sep 3, 2024
3 tasks done
Heavy dependency of Data-Juicer
enhancement
New feature or request
#398
by BeachWang
was closed Sep 25, 2024
[Bug]: Loading checkpoint shards:的时候直接kill了是什么,是内存不够了吗
bug
Something isn't working
#388
by ZHJ19970917
was closed Aug 18, 2024
3 tasks done
[Bug]: 去重的hash计算卡在100%上,一直不过滤
bug
Something isn't working
priority:high
in high priority
#387
by xiafeng-nb
was closed Sep 10, 2024
3 tasks done
Confused with the meaning of 'preprocess' time-consuming in the
reproduced_redpajama /README.md
#383
by flyflypeng
was closed Aug 13, 2024
[Bug]: 运行sandbox的时候显示ModuleNotFoundError: No module named 'tools.mm_eval'
bug
Something isn't working
#379
by Snow0111
was closed Aug 20, 2024
3 tasks done
[Bug]: 使用图片相关算子在显存充足的情况下 报OOM
bug
Something isn't working
competition:BetterSynth
issues about Better Synth competition of Data-Juicer
#378
by tian969
was closed Aug 2, 2024
3 tasks done
[Feat] Automatically Handle New feature or request
stale-issue
BrokenPipeError
Caused by Limited Memory
enhancement
#377
by yxdyc
was closed Sep 29, 2024
2 tasks done
如何根据算子提前准备好需要资源?
question
Further information is requested
#376
by tian969
was closed Jul 30, 2024
3 tasks done
Efficient processing OPs for scanned images and pdf
dj:multimodal
issues/PRs about multimodal data processing
dj:op
issues/PRs about some specific OPs
enhancement
New feature or request
stale-issue
#375
by yxdyc
was closed Sep 29, 2024
2 tasks done
[Bug]: librosa not work with np>1
bug
Something isn't working
dj:dist
issues/PRs about distributed data processing
dj:multimodal
issues/PRs about multimodal data processing
stale-issue
#372
by drcege
was closed Aug 23, 2024
[Bug]: MODEL_ZOO is not reused in subprocesses
enhancement
New feature or request
#370
by drcege
was closed Aug 21, 2024
[Bug]: Memory leak in video OP
bug
Something isn't working
#369
by BeachWang
was closed Aug 1, 2024
3 tasks done
Previous Next
ProTip!
Updated in the last three days: updated:>2024-11-07.