[Expo-tech] !DOCTYPE & charset=

Philip Sargent (Gmail) philip.sargent at gmail.com
Fri Dec 20 18:01:35 GMT 2019


More fund with grep.

Since it’s 9 years since HTML5 and CSS3 came out, I think we can agree that they are stable enough now for Expo use, so long as we are careful not to be too bold with them.

 

$ cd expoweb

$ grep -ahr DOCTYPE * | sort | uniq -c | sort -n

      1 <!DOCTYPE html>

      1 <!DOCTYPE HTML>

      1 <!DOCTYPE html PUBLIC "-//IETF//DTD HTML 4.0//EN">

      1 <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"

      1 <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">

      1 <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">

      2 <!DOCTYPE html><html>

      1 <!DOCTYPE svg PUBLIC "-//W3C//DTD SVG 1.1//EN"

      2 <!DOCTYPE svg PUBLIC "-//W3C//DTD SVG 1.1//EN" "http://www.w3.org/Graphics/SVG/1.1/DTD/svg11.dtd" [

      6 <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.0//EN">

      8 <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">

      9 <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">

     10 <!DOCTYPE html>

     10 <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">

     30 <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN"

    380 <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">

    437 <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.0//EN">

    920 <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN">

 

 

plus these

      1 dplong.write("""<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">

      1 print $out ("<!DOCTYPE html PUBLIC \"-//W3C//DTD XHTML 1.0 Strict//EN\" \"http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd\">

      1 print $triplist ("<!DOCTYPE html PUBLIC \"-//W3C//DTD XHTML 1.0 Strict//EN\" \"http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd\">

1                 this.oSmileyPopupWindow.document.write('<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">\n<html>');

and

 

$ grep -ahr "meta http-equiv=" * | sort | uniq -c | grep charset |sort –n

 

      1 <head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8">

      1 htmlfile.write("<html><head><meta http-equiv=\"Content-type\" content=\"text/html;charset=UTF-8\"><title>Prospecting Guide</title>\n")

      1 <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">

      1                 <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" />

      1 <meta http-equiv="Content-Type" content="text/html;charset=ISO-8859-1">

      1 <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"><title>Expo 2007: who and when</title>

      1 <meta http-equiv="content-type" content="text/html;charset=utf-8" />

      1 <meta http-equiv="Content-type" content="text/html;charset=UTF-8">

      1 <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />

      1 <meta http-equiv="Content-Type" content="text/html; charset=utf8" />

      1 <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">

      1         <meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />

      1         <meta http-equiv="CONTENT-TYPE" content="text/html; charset=utf-8">

      2 <meta http-equiv="Content-Type" content= "text/html; charset=iso-8859-1" />

      2 <meta http-equiv="Content-Type" content="text/html; charset=us-ascii" />

      2 <meta http-equiv=\"Content-Type\" content=\"text/html; charset=UTF-8\">

      3 <meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />

      4 <html><head><meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" />

      6 <meta http-equiv="content-type" content="text/html; charset=UTF-8" />

      7     <meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />

      9         <meta http-equiv="CONTENT-TYPE" content="text/html; charset=utf-8">

     10                 <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" />

     10 <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />

     12 <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" />

     15 <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">

     45 <head><meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" />

    772 <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" />

    921 <meta http-equiv="Content-Type" content="text/html; charset=UTF-8"/>

 

tidying up ISO-8859-1 & UTF-8 will mostly be a matter of handling umlauts I imagine.

The site is split pretty evenly between the two it seems.

 

I *presume* that we should standardise on UTF-8 but I’m all ears for alternative suggestions. (I see we have a “utf8” crept in there).

Similarly I would assume that standardising on <!DOCTYPE HTML> is uncontentious ?

 

I do enjoy a bit of software archaeology. (Does anyone else here follow  <https://twitter.com/arclight> @arclight ?)

 

 

 

Philip

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.wookware.org/pipermail/expo-tech/attachments/20191220/72c818dc/attachment-0001.html>


More information about the Expo-tech mailing list