Initial commit
This commit is contained in:
7
skills/dataset-splitter/references/README.md
Normal file
7
skills/dataset-splitter/references/README.md
Normal file
@@ -0,0 +1,7 @@
|
||||
# References
|
||||
|
||||
Bundled resources for dataset-splitter skill
|
||||
|
||||
- [ ] dataset_splitting_best_practices.md: Document outlining best practices for dataset splitting, including considerations for stratified sampling, handling imbalanced datasets, and avoiding data leakage.
|
||||
- [ ] sklearn_train_test_split_docs.md: Excerpt from scikit-learn documentation on train_test_split function, detailing parameters and usage.
|
||||
- [ ] common_dataset_formats.md: Documentation on common dataset formats (CSV, Parquet, etc.) and how to handle them.
|
||||
Reference in New Issue
Block a user