We use GPT-4 to robotically write explanations for the habits of neurons in huge language fashions and to attain the ones explanations. We liberate a dataset of those (imperfect) explanations and ratings for each and every neuron in GPT-2.
We use GPT-4 to robotically write explanations for the habits of neurons in huge language fashions and to attain the ones explanations. We liberate a dataset of those (imperfect) explanations and ratings for each and every neuron in GPT-2.