Http-equiv meta tag

yurj · January 18, 2022, 7:51am

Hi!

where the <meta http-equiv="Content-Type" content="text/html; charset=UTF-8" /> came from in the Plone pages? We would like to drop it and use <meta charset="utf-8" />.

fredvd · January 18, 2022, 10:52am

That's interesting, the meta header in main_template was already changed in 2017, but that was on master:

github.com

plone/Products.CMFPlone/blob/07574b0412460914fb5f1829638c0f9c4c084b4d/Products/CMFPlone/browser/templates/main_template.pt#L27

    
      
                  checkPermission python:context.restrictedTraverse('portal_membership').checkPermission;
                  site_properties python:context.restrictedTraverse('portal_properties').site_properties;
                  ajax_include_head python:request.get('ajax_include_head', False);
                  ajax_load python:False;"
              i18n:domain="plone"
              tal:attributes="lang lang;">
          
          
  <metal:cache tal:replace="structure provider:plone.httpheaders" />
          
          
<head>
            <meta charset="utf-8" />
          
          
  <div tal:replace="structure provider:plone.htmlhead" />
          
          
  <tal:comment replace="nothing">
                Various slots where you can insert elements in the header from a template.
            </tal:comment>
            <metal:topslot define-slot="top_slot" />
            <metal:headslot define-slot="head_slot" />
            <metal:styleslot define-slot="style_slot" />

Plone 5.2.x branch still has the content attribute as well:

github.com

plone/Products.CMFPlone/blob/00a7993512fbc5201e8ab70d0251a98f6be042ed/Products/CMFPlone/browser/templates/main_template.pt#L26

    
      
                  checkPermission python:context.restrictedTraverse('portal_membership').checkPermission;
                  site_properties python:context.restrictedTraverse('portal_properties').site_properties;
                  ajax_include_head python:request.get('ajax_include_head', False);
                  ajax_load python:False;"
              i18n:domain="plone"
              tal:attributes="lang lang;">
          
          
  <metal:cache tal:replace="structure provider:plone.httpheaders" />
          
          
<head>
            <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
          
          
  <div tal:replace="structure provider:plone.htmlhead" />
          
          
  <tal:comment replace="nothing">
                Various slots where you can insert elements in the header from a template.
            </tal:comment>
            <metal:topslot define-slot="top_slot" />
            <metal:headslot define-slot="head_slot" />
            <metal:styleslot define-slot="style_slot" />

yurj · January 18, 2022, 11:17am

If you render a page in Plone 6, you will see the http-equiv meta tag. It is not in the main_template, I suppose.

tmassman · January 18, 2022, 11:25am

This is coming from the theme: plonetheme.barceloneta/index.html at a0df3545a5d6c2dd0198fe30ceea96e9eddf5db9 · plone/plonetheme.barceloneta · GitHub

It was changed 3 months ago.

yurj · January 18, 2022, 11:57am

If you try to remove it or do a custom theme, it is still there.

The page above is a demo, the real index is this:

https://github.com/plone/plonetheme.barceloneta/blob/master/plonetheme/barceloneta/theme/index.html and I'm running it. If you render the page, http-equiv get in. Maybe it came from Zope 5.3.0?

yurj · January 24, 2022, 3:53pm

It is really strange, if you grep 'http-equiv' it is read and used in just one zmi template and something else in read, but nowhere else. It does not appear anywhere in Plone.

github.com/plone/plonetheme.barceloneta

Duplicate html charset declaration

opened 03:47PM - 24 Jan 22 UTC

giulioturetta

If you render a page in Plone 6, you will see a duplicate charset meta element: … ```html <head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8" /> <title>My site title</title> [...] <meta charset="utf-8" /> [...] </head> ``` As stated in https://www.w3.org/International/questions/qa-html-encoding-declarations, _"The declaration should fit completely within the first 1024 bytes at the start of the file, so it's best to put it immediately after the opening head tag"_. Expected behaviour: have only one `<meta charset="utf-8" />` element right after the opening `head` tag.

yurj · January 30, 2022, 3:42pm

I've found this:

https://lxml.de/api/lxml.html-module.html

tostring(doc, pretty_print=False, include_meta_content_type=False, encoding=None, method=' html ', with_tail=True, doctype=None) source code

Return an HTML string representation of the document.

Note: if include_meta_content_type is true this will create a <meta http-equiv="Content-Type" ...> tag in the head; regardless of the value of include_meta_content_type any existing <meta http-equiv="Content-Type" ...> tag will be removed

but the default seems False. But maybe worth a check?

fredvd · January 31, 2022, 3:43pm

but the default seems False. But maybe worth a check?

Sigh. That could very well be true and valuable to check. I thought quickly about Diazo when replying but left it out in my first respone.

I had an issue years ago where <!DOCTYPE html> appeared twice in the html or conflicted with the html4/xhtml version, which was also added by lxml.

[edit: fix escaping of html in the text]

iham · April 12, 2022, 7:34am

@yurj pointed me here, as I wrote an issue as it seems doctype and namespace don't match...

iham · April 12, 2022, 7:39am

ok little digging: HTML html xmlns Attribute

"The xmlns attribute is required in XHTML, invalid in HTML 4.01, and optional in HTML5."
and
"This is because the namespace "xmlns=http://www.w3.org/1999/xhtml" is default, and will be added to the <html> tag even if you do not include it."

so its ok to have it there even in html5 - I'll close my ticket

petschki · November 24, 2022, 3:27pm

Well ... some investigations on the http-equiv meta tag:

I finally landed via plone.transformchain in the module repoze.xmliter.serialize where a lxml.etree object simply gets stringified (repoze.xmliter/serializer.py at master · repoze/repoze.xmliter · GitHub) ... before this the results of [e.attrib for e in self.tree.xpath("//meta")] is:

[
    {'charset': 'utf-8'},
    {'name': 'viewport', 'content': 'width=device-width, initial-scale=1.0'},
    {'name': 'generator', 'content': 'Plone - https://plone.org/'}
]

after str(self.tree) the result startswith:

'<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" class="h-100" lang="de" xml:lang="de">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />
    <title>

so the http-equiv was added to the tree ... now googling about this the only result I've found was this comment on stackoverflow python - Not to escape attribute contents in lxml (python3)? - Stack Overflow where it says:

...The str(result) approach does work but has several problems:
(1) It automatically adds <meta http-equiv="Content-Type" content="text/html; charset=UTF-8"> (NOT a valid xhtml) just after the <head> tag;...

wow ... but I do not really know how to get rid of this. I think I file an issue on the lxml repository and look what's the response