The Ubuntu training corpus

The Ubuntu training corpus is an enormous (3gb) collection of conversational text from Ubuntus tech-support system.

It's heavily biased and hopelessly garbage but should allow for a decent starting point.

Locations

📖 stoas

⥱ context

← back
main library chatterbot

↑ pushing here
(none)

↓ pulling this
(none)

→ forward
main library chatterbot
why i didn t use chatterbot

🔎 full text search for 'the ubuntu training corpus'