Data processing and ML with Designer
- Clean GitHub code data for LLM training
- LLM data processing: Wikipedia
- LLM data processing: arXiv
- LLM Data Processing: Alpaca-CoT (SFT Data)
- Processing Alpaca-CoT SFT data with DLC components
- LLM data processing for GitHub code (DLC)
- Image-text filtering
- Video data filtering and labeling
- A CTR prediction solution with offline-online consistency
- Heart disease prediction
- Classify news based on text analysis
- Predict agricultural loan eligibility
- Discretize continuous features with the Binning component
- Predict student exam scores
- Automatic similar tag classification
- Air quality prediction
- Predict power plant output
- Power theft identification
- Offline scheduling