用python快速过滤html指定标签函数

967 阅读 0 评论 0 点赞

用python快速过滤html指定标签函数

"""
@author: MR.N
@created: 2022/3/30 Wed.
@version: 1.0
"""
 
import io
import re
 
 
def filter_html_tags(text):
    htmltags = ['div', 'ul', 'li', 'ol', 'p', 'span', 'form', 'br',
                'h1', 'h2', 'h3', 'h4', 'h5', 'h6',
                'hr', 'input',
                'title', 'table', 'tbody', 'a',
                'i', 'strong', 'b', 'big', 'small', 'u', 's', 'strike',
                'img', 'center', 'dl', 'dt', 'font', 'em',
                'code', 'pre', 'link', 'meta', 'iframe', 'ins']
    blocktags = ['script', 'style']
    tabletags = ['tr', 'th', 'td']
    for tag in htmltags:
        # filter html tag with its attribute descriptions
        text = re.sub(f'<{tag}[^<>]*[/]?>', '', text)
        text = re.sub(f'</{tag}>', '', text)
    # '''
    for block in blocktags:
        re_block = re.compile('<\s*{block}[^>]*>[\S\s]*?<\s*/\s*{block}\s*>',re.I)#script
        text = re_block.sub('',text) #

    buffer = io.StringIO(text)
    text = ''
    line = buffer.readline()
    while line is not None and line != '':
        for tag in tabletags:
            if '<' + tag in line or '</' + tag in line:
                if len(line) < 2:
                    # len('\n') == 1
                    if ascii(line) == '\\n':
                        line = ''
                while '\n' in line:
                    line = line.replace('\n', '')
                line = re.sub(f'<{tag}[^<>]*[/]?>', '', line)
                line = re.sub(f'</{tag}>', '', line)
                # filter multiple spaces
                line = line.replace(' ', '')
        text += line
        line = buffer.readline()
    # '''
 
    # filter multiple empty lines
    while '\n\n' in text:
        text = text.replace("\n\n", '\n')
    return text

点赞(0) 打赏

本文分类：PYTHON编程
本文标签：无
浏览次数：967 次浏览
发布日期：2023-08-15 00:13:00
本文链接：http://yelongauto.com/index.php/PYTHONbiancheng/2072.html

评论列表共有 0 条评论

暂无评论

发表评论取消回复

微信小程序

微信扫一扫体验

微信公众账号

微信扫一扫加关注

发表
评论返回
顶部

基本文件流程错误 SQL 调试

/www/wwwroot/new.yelongauto.com/public/index.php ( 0.88 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/start.php ( 0.72 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/base.php ( 2.60 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Loader.php ( 21.07 KB )
/www/wwwroot/new.yelongauto.com/vendor/composer/autoload_static.php ( 10.16 KB )
/www/wwwroot/new.yelongauto.com/vendor/symfony/deprecation-contracts/function.php ( 0.98 KB )
/www/wwwroot/new.yelongauto.com/vendor/symfony/polyfill-php80/bootstrap.php ( 1.50 KB )
/www/wwwroot/new.yelongauto.com/vendor/symfony/polyfill-mbstring/bootstrap.php ( 7.33 KB )
/www/wwwroot/new.yelongauto.com/vendor/ralouphie/getallheaders/src/getallheaders.php ( 1.60 KB )
/www/wwwroot/new.yelongauto.com/vendor/guzzlehttp/guzzle/src/functions_include.php ( 0.16 KB )
/www/wwwroot/new.yelongauto.com/vendor/guzzlehttp/guzzle/src/functions.php ( 5.55 KB )
/www/wwwroot/new.yelongauto.com/vendor/symfony/polyfill-php73/bootstrap.php ( 0.99 KB )
/www/wwwroot/new.yelongauto.com/vendor/ezyang/htmlpurifier/library/HTMLPurifier.composer.php ( 0.10 KB )
/www/wwwroot/new.yelongauto.com/vendor/topthink/think-helper/src/helper.php ( 2.88 KB )
/www/wwwroot/new.yelongauto.com/vendor/karsonzhang/fastadmin-addons/src/common.php ( 15.07 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Route.php ( 60.23 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Config.php ( 6.38 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Hook.php ( 4.71 KB )
/www/wwwroot/new.yelongauto.com/vendor/overtrue/wechat/src/Kernel/Support/Helpers.php ( 2.54 KB )
/www/wwwroot/new.yelongauto.com/vendor/overtrue/wechat/src/Kernel/Helpers.php ( 1.89 KB )
/www/wwwroot/new.yelongauto.com/vendor/topthink/think-captcha/src/helper.php ( 1.94 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Validate.php ( 41.60 KB )
/www/wwwroot/new.yelongauto.com/vendor/topthink/think-queue/src/common.php ( 1.19 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Console.php ( 23.13 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Error.php ( 3.75 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/convention.php ( 10.37 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/App.php ( 21.58 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Request.php ( 49.78 KB )
/www/wwwroot/new.yelongauto.com/application/config.php ( 12.03 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Env.php ( 1.21 KB )
/www/wwwroot/new.yelongauto.com/application/database.php ( 2.25 KB )
/www/wwwroot/new.yelongauto.com/application/extra/addons.php ( 1.62 KB )
/www/wwwroot/new.yelongauto.com/application/extra/queue.php ( 0.55 KB )
/www/wwwroot/new.yelongauto.com/application/extra/site.php ( 0.99 KB )
/www/wwwroot/new.yelongauto.com/application/extra/upload.php ( 0.81 KB )
/www/wwwroot/new.yelongauto.com/application/tags.php ( 1.23 KB )
/www/wwwroot/new.yelongauto.com/application/common.php ( 15.57 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/helper.php ( 17.41 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Debug.php ( 7.13 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Log.php ( 6.05 KB )
/www/wwwroot/new.yelongauto.com/addons/addondev/Addondev.php ( 1.44 KB )
/www/wwwroot/new.yelongauto.com/vendor/karsonzhang/fastadmin-addons/src/Addons.php ( 7.05 KB )
/www/wwwroot/new.yelongauto.com/addons/cms/Cms.php ( 6.48 KB )
/www/wwwroot/new.yelongauto.com/addons/crontab/Crontab.php ( 1.94 KB )
/www/wwwroot/new.yelongauto.com/addons/baidupush/Baidupush.php ( 1.99 KB )
/www/wwwroot/new.yelongauto.com/addons/epay/Epay.php ( 2.39 KB )
/www/wwwroot/new.yelongauto.com/addons/recharge/Recharge.php ( 2.49 KB )
/www/wwwroot/new.yelongauto.com/addons/vip/Vip.php ( 4.60 KB )
/www/wwwroot/new.yelongauto.com/addons/withdraw/Withdraw.php ( 1.79 KB )
/www/wwwroot/new.yelongauto.com/addons/nkeditor/Nkeditor.php ( 1.20 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Cache.php ( 6.10 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/cache/driver/File.php ( 7.27 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/cache/Driver.php ( 5.98 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/View.php ( 6.77 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/view/driver/Think.php ( 5.64 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Template.php ( 44.92 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/template/driver/File.php ( 2.24 KB )
/www/wwwroot/new.yelongauto.com/addons/addondev/library/ClassLoader.php ( 0.82 KB )
/www/wwwroot/new.yelongauto.com/addons/cms/config.php ( 33.94 KB )
/www/wwwroot/new.yelongauto.com/application/common/behavior/Common.php ( 3.02 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Lang.php ( 7.42 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/lang/zh-cn.php ( 11.81 KB )
/www/wwwroot/new.yelongauto.com/application/route.php ( 0.90 KB )
/www/wwwroot/new.yelongauto.com/vendor/karsonzhang/fastadmin-addons/src/addons/Route.php ( 2.76 KB )
/www/wwwroot/new.yelongauto.com/application/common/lang/zh-cn/addon.php ( 6.09 KB )
/www/wwwroot/new.yelongauto.com/extend/fast/Form.php ( 39.79 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/config/driver/Ini.php ( 0.83 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Url.php ( 12.72 KB )
/www/wwwroot/new.yelongauto.com/addons/cms/controller/Archives.php ( 5.94 KB )
/www/wwwroot/new.yelongauto.com/addons/cms/controller/Base.php ( 3.75 KB )
/www/wwwroot/new.yelongauto.com/vendor/karsonzhang/fastadmin-addons/src/addons/Controller.php ( 6.49 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Controller.php ( 6.07 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/traits/controller/Jump.php ( 4.92 KB )
/www/wwwroot/new.yelongauto.com/vendor/symfony/http-foundation/IpUtils.php ( 6.62 KB )
/www/wwwroot/new.yelongauto.com/vendor/symfony/polyfill-php80/Php80.php ( 3.49 KB )
/www/wwwroot/new.yelongauto.com/addons/cms/lang/zh-cn.php ( 5.58 KB )
/www/wwwroot/new.yelongauto.com/application/common/library/Auth.php ( 15.22 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Cookie.php ( 7.54 KB )
/www/wwwroot/new.yelongauto.com/application/common/model/Config.php ( 6.71 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Model.php ( 69.55 KB )
/www/wwwroot/new.yelongauto.com/addons/vip/config.php ( 2.63 KB )
/www/wwwroot/new.yelongauto.com/addons/cms/library/Service.php ( 29.34 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Db.php ( 6.67 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/db/connector/Mysql.php ( 3.89 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/db/Connection.php ( 29.97 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/db/Query.php ( 93.80 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/db/builder/Mysql.php ( 4.53 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/db/Builder.php ( 31.81 KB )
/www/wwwroot/new.yelongauto.com/addons/epay/library/Service.php ( 18.17 KB )
/www/wwwroot/new.yelongauto.com/addons/epay/config.php ( 2.25 KB )
/www/wwwroot/new.yelongauto.com/addons/cms/model/Archives.php ( 21.81 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/traits/model/SoftDelete.php ( 4.86 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/model/relation/BelongsTo.php ( 7.75 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/model/relation/OneToOne.php ( 10.03 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/model/Relation.php ( 3.61 KB )
/www/wwwroot/new.yelongauto.com/addons/cms/model/Channel.php ( 18.63 KB )
/www/wwwroot/new.yelongauto.com/addons/cms/model/Modelx.php ( 1.97 KB )
/www/wwwroot/new.yelongauto.com/addons/cms/model/Fields.php ( 3.46 KB )
/www/wwwroot/new.yelongauto.com/addons/cms/model/SpiderLog.php ( 1.75 KB )
/www/wwwroot/new.yelongauto.com/addons/cms/model/Tag.php ( 5.62 KB )
/www/wwwroot/new.yelongauto.com/addons/cms/model/Autolink.php ( 0.57 KB )
/www/wwwroot/new.yelongauto.com/application/common/model/User.php ( 3.95 KB )
/www/wwwroot/new.yelongauto.com/runtime/temp/b0a402f264a1f2e352d9f07507ba57fb.php ( 36.60 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/db/Expression.php ( 1.11 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Session.php ( 10.86 KB )
/www/wwwroot/new.yelongauto.com/extend/fast/Tree.php ( 15.60 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/model/Collection.php ( 2.27 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Collection.php ( 11.10 KB )
/www/wwwroot/new.yelongauto.com/addons/cms/model/Comment.php ( 9.47 KB )
/www/wwwroot/new.yelongauto.com/addons/cms/library/Bootstrap.php ( 5.49 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Paginator.php ( 9.94 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/Response.php ( 8.28 KB )
/www/wwwroot/new.yelongauto.com/thinkphp/library/think/debug/Html.php ( 4.17 KB )

[ BEHAVIOR ] Run Closure @app_init [ RunTime:0.000046s ]
[ CACHE ] INIT File
[ BEHAVIOR ] Run \addons\addondev\Addondev @app_init [ RunTime:0.000573s ]
[ BEHAVIOR ] Run \addons\cms\Cms @app_init [ RunTime:0.000392s ]
[ BEHAVIOR ] Run \addons\crontab\Crontab @app_init [ RunTime:0.000201s ]
[ BEHAVIOR ] Run Closure @app_init [ RunTime:0.000278s ]
[ BEHAVIOR ] Run app\common\behavior\Common @app_init [ RunTime:0.000154s ]
[ LANG ] /www/wwwroot/new.yelongauto.com/thinkphp/lang/zh-cn.php
[ BEHAVIOR ] Run app\common\behavior\Common @app_dispatch [ RunTime:0.000059s ]
[ ROUTE ] array ( 'type' => 'method', 'method' => array ( 0 => '\\think\\addons\\Route', 1 => 'execute', ), 'var' => array ( 'addon' => 'cms', 'controller' => 'archives', 'action' => 'index', ), )
[ HEADER ] array ( 'host' => 'yelongauto.com', 'accept-encoding' => 'gzip, br, zstd, deflate', 'user-agent' => 'Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)', 'accept' => '*/*', 'content-length' => '', 'content-type' => '', )
[ PARAM ] array ( 'catename' => 'PYTHONbiancheng', 'id' => '2072', )
[ RUN ] think\addons\Route->execute[ /www/wwwroot/new.yelongauto.com/vendor/karsonzhang/fastadmin-addons/src/addons/Route.php ]
[ LANG ] /www/wwwroot/new.yelongauto.com/public/../application/common/lang/zh-cn/addon.php
[ BEHAVIOR ] Run app\common\behavior\Common @addon_begin [ RunTime:0.000355s ]
[ LANG ] /www/wwwroot/new.yelongauto.com/addons/cms/lang/zh-cn.php
[ BEHAVIOR ] Run \addons\vip\Vip @upload_config_init [ RunTime:0.000225s ]
[ DB ] INIT mysql
[ BEHAVIOR ] Run \addons\epay\Epay @addon_action_begin [ RunTime:0.001025s ]
[ VIEW ] /www/wwwroot/new.yelongauto.com/addons/cms/view/default/show_news.html [ array ( 0 => 'config', 1 => 'user', 2 => 'site', 3 => '__CHANNEL__', 4 => 'isWechat', 5 => '__ARCHIVES__', 6 => '__MODEL__', ) ]
[ SESSION ] INIT array ( 'id' => '', 'var_session_id' => '', 'prefix' => 'think', 'expire' => 360000, 'type' => '', 'auto_start' => true, )
[ BEHAVIOR ] Run \addons\cms\Cms @view_filter [ RunTime:0.000208s ]

0.770078s

ShowPageTrace