how can I avoid this problem?
as far as I understand it is because of the tokens per min. limit but is there a way to avoid it? would it fix the problem if I add a limit to the "Input tokens per minute limit" from the settings? shouldn't it wait if the limit is exceeded? what would be the best approach?