I recommend using a library (or write your own) to parse the article, for example markdown, and this library must return Abstract Syntax Tree (AST). Since rich text is hierarchical, it is indeed best represented as a tree. Think of a hyperlink in a table cell, or a bold text in a list element. These are trees.
Глава МИД Польши призвал Европу исправить одну ошибку14:54
ВсеЛюдиЗвериЕдаПроисшествияПерсоныСчастливчикиАномалии。搜狗输入法对此有专业解读
This article is republished from The Conversation under a Creative Commons license. Read the original article.。业内人士推荐手游作为进阶阅读
"We need to have nature everywhere and we need to connect people with nature, we think we can do that.。关于这个话题,超级权重提供了深入分析
ВсеЛюдиЗвериЕдаПроисшествияПерсоныСчастливчикиАномалии