htsvoice files are voice models produced and used by the [HTS] [vocal-synthesis] framework. They are generated by training tagged input audio from a speaker/singer and can be used with HTS and tools built on top of it to generate speech or singing waveforms.
Unfortunately, this does not seem to be a widely used or available format. There is one widely available model (hts-voice-nitech-jp-atr503-m001
) and a few (literally 2-3) other sites online that I found with others available for free download.
HTS voices are models that are trained, which according to someone on the [utau] forums requires "around 4 hours of english recordings to do a decent HMM voice"