English 中文(简体)
准备聊天机器人训练数据集:将著名作家的文本文件转换成[输入,目标]格式。
原标题:Preparing a chatbot training dataset: Converting famous writer s txt files into [input,target] format

Now I have a pretrained AI model, it is a chat bot model, according to its develop documentation, it can be fine tuning by feeding custom data. And the data format is: [{"input": input_query, "target": target_query}, ...] It requires two parts for an item: input and target. Currently I have a lots of txt files, they are a famous writer s work, like Da Vinci or picasso in artist. What I want to achieve is chating with the robot who responsing in the style of the famous writer. I want to train a specific style/personality chat robot. The question is: how to convert the txt files (the writer s works) to [input,target] formatted dataset? Any comments are welcome.

我使用的预训练模型: https://github.com/clue-ai/ChatYuan

问题回答

暂无回答




相关问题
Python PIP install online repository

So my problem is that VS code doesn t recognise PIP install: the following is my github repo im trying to access https://github.com/dsdanielpark/Gemini-API.git and this is my code: import csv from ...

热门标签