However, they express concerns about their limited programming skills and the potential quality of their results. They are seeking feedback on whether this project is feasible and if the results, despite potentially being poor, would be sufficient to measure differences in the LLM's output based on the training data.
Key takeaways:
- The author is an IB diploma candidate in high school planning to write a research paper on how an LLM's training data impacts its output.
- The author plans to compare the output of an LLM trained on different sources, such as Wikipedia and Reddit.
- The author has access to powerful Nvidia GPUs and plenty of time for training the LLM.
- Despite having some technical skills, the author admits to having weak programming abilities and is seeking advice on the feasibility of the project.