See all comments (3)
Top ten is just shy of a hundred million of the patterns, 67% of all patterns. Top 100 is 83%. In total, there are only 64 296 unique patterns, which means that each pattern on average occurs 2 333 times. 62 327 of the patterns were valid in the sense that Node.js could parse them as a regex without errors.
,这一点在PDF资料中也有详细论述
2026-03-09 00:00:00:03014415810http://paper.people.com.cn/rmrb/pc/content/202603/09/content_30144158.htmlhttp://paper.people.com.cn/rmrb/pad/content/202603/09/content_30144158.html11921 贵州奋力打造“多彩贵州”文旅新品牌
01:42, 9 марта 2026Мир
Numbered or labeled points dressed up as continuous prose. The model writes what is essentially a listicle but wraps each point in a paragraph that starts with "The first... The second... The third..." to disguise the format. Perhaps you told it to stop generating lists and it decided to do this instead... still very common.