Recently, it was reported that ByteDance was using OpenAI (an artificial intelligence research company based in the United States) technology to develop its own large language model, violating OpenAI's terms of service. In response, the relevant person in charge of ByteDance responded, "When using OpenAI-related services, the company emphasizes that it must comply with its terms of use." We are also in contact with OpenAI to clarify possible misunderstandings caused by external reports. ”
The person in charge also introduced the introduction of ByteDance's use of OpenAI services
First of all, at the beginning of this year, when the technical team first started to explore large models, some engineers applied the API (Application Programming Interface) service of GPT (Generative Pre-trained Transformer Model) to experimental project research on smaller models. The model is for testing only, with no plans to go live, and has never been used externally. This practice has been discontinued after the company introduced GPT API call specification checking in April.
Secondly, in April this year, the Byte Big Model team has put forward clear internal requirements that the data generated by the GPT model must not be added to the training dataset of the Byte Big Model, and train the engineer team to comply with the terms of service when using GPT.
Again, in September, Bytes conducted another round of internal inspections and took measures to further ensure that API calls to GPT meet the requirements of the specification. For example, the similarity between the training data of the model and GPT is detected by sampling in batches to prevent data annotators from using GPT without permission.
Finally, in the coming days, ByteDance will conduct another comprehensive inspection to ensure strict compliance with the terms of use of the relevant services.
Edited by Song Yuting.
Proofread by Wu Xingfa.