In its raw frequency variety, tf is just the frequency on the "this" for every document. In Every document, the word "this" appears the moment; but because the document 2 has much more phrases, its relative frequency is smaller.epoch. For this reason a Dataset.batch used soon after Dataset.repeat will generate batches that straddle epoch boundaries