就像是:
for anchor in tbody.findAll('div', ): text = ''.join([x for x in anchor.contents if isinstance(x, bs4.element.NavigableString)])
作品。只是知道您还会在其中获得换行符,所以
.strip()可能需要ing。
例如:
for anchor in tbody.findAll('div', ): text = ''.join([x for x in anchor.contents if isinstance(x, bs4.element.NavigableString)]) print([text]) print([text.strip()])
[u'nnnHere is text 3 and this is what I want.n'][u'Here is text 3 and this is what I want.']
(我将它们放在列表中,以便您可以看到换行符。)
欢迎分享,转载请注明来源:内存溢出
评论列表(0条)