将文本分为3个均匀大小的组的算法

将文本分为3个均匀大小的组的算法,第1张

将文本分为3个均匀大小的组的算法

同样来自Wikipedia上有关自动换行的文章的“最小衣衫program”动态程序可以适应您的需求。设置

LineWidth = len(text)/n -1
并忽略有关超过行宽的无限惩罚的注释;与一起使用
c(i, j)
as 的定义
P = 2


Code. I took the liberty of modifying the DP always to return exactly n lines, at the cost of increasing the running time from O(#words ** 2) to O(#words ** 2 * n).def minragged(text, n=3):    """    >>> minragged('Just testing to see how this works.')    ['Just testing', 'to see how', 'this works.']    >>> minragged('Just testing to see how this works.', 10)    ['', '', 'Just', 'testing', 'to', 'see', 'how', 'this', 'works.', '']    """    words = text.split()    cumwordwidth = [0]    # cumwordwidth[-1] is the last element    for word in words:        cumwordwidth.append(cumwordwidth[-1] + len(word))    totalwidth = cumwordwidth[-1] + len(words) - 1  # len(words) - 1 spaces    linewidth = float(totalwidth - (n - 1)) / float(n)  # n - 1 line breaks    def cost(i, j):        """        cost of a line words[i], ..., words[j - 1] (words[i:j])        """        actuallinewidth = max(j - i - 1, 0) + (cumwordwidth[j] - cumwordwidth[i])        return (linewidth - float(actuallinewidth)) ** 2    # best[l][k][0] is the min total cost for words 0, ..., k - 1 on l lines    # best[l][k][1] is a minimizing index for the start of the last line    best = [[(0.0, None)] + [(float('inf'), None)] * len(words)]    # xrange(upper) is the interval 0, 1, ..., upper - 1    for l in xrange(1, n + 1):        best.append([])        for j in xrange(len(words) + 1): best[l].append(min((best[l - 1][k][0] + cost(k, j), k) for k in xrange(j + 1)))    lines = []    b = len(words)    # xrange(upper, 0, -1) is the interval upper, upper - 1, ..., 1    for l in xrange(n, 0, -1):        a = best[l][b][1]        lines.append(' '.join(words[a:b]))        b = a    lines.reverse()    return linesif __name__ == '__main__':    import doctest    doctest.testmod()


欢迎分享,转载请注明来源:内存溢出

原文地址: http://outofmemory.cn/zaji/5643211.html

(0)
打赏 微信扫一扫 微信扫一扫 支付宝扫一扫 支付宝扫一扫
上一篇 2022-12-16
下一篇 2022-12-16

发表评论

登录后才能评论

评论列表(0条)

保存