Files
gh-jeremylongshore-claude-c…/skills/dataset-splitter/references/README.md
2025-11-29 18:51:12 +08:00

527 B

References

Bundled resources for dataset-splitter skill

  • dataset_splitting_best_practices.md: Document outlining best practices for dataset splitting, including considerations for stratified sampling, handling imbalanced datasets, and avoiding data leakage.
  • sklearn_train_test_split_docs.md: Excerpt from scikit-learn documentation on train_test_split function, detailing parameters and usage.
  • common_dataset_formats.md: Documentation on common dataset formats (CSV, Parquet, etc.) and how to handle them.