Files
ucsinfer/TODO.md
2025-09-26 20:52:08 -07:00

38 lines
681 B
Markdown

## Recommend
- Use History when adding catids
## Gather
- Maybe more dataset configurations
## Qualify
- Print stats for a dataset
## Fine-tune
- https://www.sbert.net/docs/sentence_transformer/loss_overview.html#loss-table
- Use (anchor, positive) pairs to train a new model
- Use (sentence) + class labels to train a new model
- Implement BatchAllTripletLoss
- Train with anchored definitions and/or...
- Train with class labels
## Evaluate
- Print more information about the dataset coverage of UCS
- Allow skipping model testing for this
- Print raw output
<!-- - Maybe load everything into a sqlite for slicker reporting -->
## Utility
- Clear caches