[Top][All Lists]

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: "Readability" feature in eww

From: Rasmus
Subject: Re: "Readability" feature in eww
Date: Mon, 03 Nov 2014 12:15:00 +0100
User-agent: Gnus/5.130012 (Ma Gnus v0.12) Emacs/24.3.94 (gnu/linux)

Rüdiger Sonderfeld <address@hidden> writes:

> On Monday 03 November 2014 01:41:14 Lars Magne Ingebrigtsen wrote:
>> This is a heuristic, of course, so it can be tweaked endlessly.  The
>> current algorithm just gives most words a positive score, HTML markup a
>> negative score, and words inside <a> tags a negative score.  For such a
>> simple algorithm, it seems to give pretty good results.
>> But tweaking is necessary for it to be ... better.  If anybody has ideas
>> for tweaks or better algorithms, please be my guest and have at it.
> HTML5 has introduced tags such as <main> and <article>, which can be used to 
> identify the important parts.  I'm not sure how widespread their use thus far 
> is
> (I think org-mode supports it already if one sets the HTML5 export option).  

Indeed, but html5 is not default.  As far as I remember you'd have to
wrap your article part in #+begin_article ⋯ #+end_article.  There was
a dicussion at some point, and there were some good html5-reasons why
the body is not wrappend in article by default.


. . . The proofs are technical in nature and provides no real understanding

reply via email to

[Prev in Thread] Current Thread [Next in Thread]