-
Notifications
You must be signed in to change notification settings - Fork 162
Issues: modelscope/data-juicer
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
why often happen: One of the subprocesses has abruptly died during map operation?
question
Further information is requested
#430
opened Sep 14, 2024 by
strongcc
3 tasks done
[Bug]: undefined symbol: _ZN3c104cuda9SetDeviceE
bug
Something isn't working
#419
opened Sep 7, 2024 by
lh61500
3 tasks done
[Feat] Data-Juicer as a Service
enhancement
New feature or request
#417
opened Sep 5, 2024 by
drcege
2 tasks done
[Feat] Support New feature or request
dj_batched_group_ops
that allows for the configuration and application of multiple operators in smaller, manageable batches
enhancement
#413
opened Sep 2, 2024 by
yxdyc
2 tasks done
[Feat] Support New feature or request
PythonCodesOperator
and BashCodesOperator
that wraps an existing python file, or some code snippets to be executed, such as the existing DJ tools.
enhancement
#412
opened Sep 2, 2024 by
yxdyc
2 tasks done
Guidance for OP with multiple data fields to be processed
enhancement
New feature or request
#411
opened Sep 2, 2024 by
yxdyc
2 tasks done
Heavy dependency of Data-Juicer
enhancement
New feature or request
#398
opened Aug 22, 2024 by
BeachWang
[Feat] Automatically Handle New feature or request
BrokenPipeError
Caused by Limited Memory
enhancement
#377
opened Jul 30, 2024 by
yxdyc
2 tasks done
Efficient processing OPs for scanned images and pdf
dj:multimodal
issues/PRs about multimodal data processing
dj:op
issues/PRs about some specific OPs
enhancement
New feature or request
#375
opened Jul 30, 2024 by
yxdyc
2 tasks done
[Feat]: Add Ray actor support
dj:dist
issues/PRs about distributed data processing
enhancement
New feature or request
#371
opened Jul 29, 2024 by
drcege
ProTip!
What’s not been updated in a month: updated:<2024-08-20.