[www] 10/19: split up template.py, make site generation a class.

gnunet-svn
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[www] 10/19: split up template.py, make site generation a class.

From:	gnunet
Subject:	[www] 10/19: split up template.py, make site generation a class.
Date:	Wed, 13 Nov 2019 01:28:42 +0100
This is an automated email from the git hooks/post-receive script.

ng0 pushed a commit to branch master
in repository www.

commit 98690996582e0e511a8e8611b3db5080be6e75b0
Author: ng0 <address@hidden>
AuthorDate: Tue Nov 12 17:28:30 2019 +0000

    split up template.py, make site generation a class.
---
 inc/fileproc.py |  85 +++++++++++++++++
 inc/site.py     | 127 ++++++++++++++++++++++++
 inc/sitemap.py  |  20 ++++
 inc/sum.py      |  35 +++++++
 inc/textproc.py |  35 +++++++
 template.py     | 292 ++++----------------------------------------------------
 www.yml         |  48 +++++++---
 7 files changed, 354 insertions(+), 288 deletions(-)

diff --git a/inc/fileproc.py b/inc/fileproc.py
new file mode 100644
index 0000000..435078b
--- /dev/null
+++ b/inc/fileproc.py
@@ -0,0 +1,85 @@
+from pathlib import Path
+
+def copy_files(kind, conf, locale, inlist, ptarget):
+    o = Path(ptarget)
+    for item in conf[inlist]:
+        i = Path(kind + "/" + item["file"])
+        # print(i)
+        for t in item["targets"]:
+            d_loc = o / locale / t
+            d = o / t
+            # print(d)
+            if i.is_file() is not False:
+                d_loc.write_text(i.read_text())
+                print("copied " + str(i) + " to " + str(d_loc) + "...")
+                d.write_text(i.read_text())
+                print("copied " + str(i) + " to " + str(d) + "...")
+
+
+def rm_rf(directory):
+    directory = Path(directory)
+    for child in directory.glob('*'):
+        if child.is_file():
+            child.unlink()
+        else:
+            rm_rf(child)
+    # directory.rmdir()
+
+
+def fileop(infile, outfile, action):
+    """
+    infile: inputfile, Path object
+    outfile: outputfile, Path object
+    action: action if any, String
+    """
+    i = Path(infile)
+    o = Path(outfile)
+    outdir = Path("rendered")
+    if i.is_file() is not False:
+        if action == "copy":
+            # Write content of i to o.
+            o.write_text(i.read_text())
+        if action == "link":
+            o.symlink_to(i)
+
+
+def write_name(filename, infile, locale, replacer):
+    return "./rendered/" + locale + "/" + infile.replace(replacer,
+                                                         '').rstrip(".j2")
+
+
+def localized(filename, locale, *args):
+    if len(args) == 0:
+        return "../" + locale + "/" + filename
+    ext = kwargs.get('ext', None)
+    if ext is not None:
+        lf = filename + "." + locale + "." + ext
+        lp = Path(lf)
+        if locale == "en" or not lp.is_file():
+            return "../" + filename + "." + ext
+        else:
+            return "../" + lf
+
+
+# This generates and switches sites generations, preventing
+# in-place modification of the website.
+# * save old generation directory name
+# * jinja2 creates content in "rendered" (happened before calling this 
function)
+# * calculate sum of "rendered"
+# * move "rendered" to out/$sum
+# * remove symlink "html_dir"
+# * symlink out/$sum to "html_dir"
+# * delete old generation directory
+def generation_dir(htmldir):
+    oldgen = Path(htmldir).resolve()
+    # precondition: jinja2 has created the files in "rendered".
+    newgen = Path("rendered")
+    newgen_sum = walksum(newgen)
+    outdir = Path("out")
+    outdir.mkdir(parents=True, exist_ok=True)
+    newgen_target = Path("out") / newgen_sum
+    newgen.rename(newgen_target)
+    html = Path(htmldir)
+    html.unlink()
+    fileop(newgen, html, "link")
+    rm_rf(oldgen)
diff --git a/inc/site.py b/inc/site.py
new file mode 100644
index 0000000..67e5afa
--- /dev/null
+++ b/inc/site.py
@@ -0,0 +1,127 @@
+import os
+import os.path
+import sys
+import re
+import gettext
+import glob
+import codecs
+import jinja2
+import hashlib
+from pathlib import Path, PurePosixPath
+from ruamel.yaml import YAML
+import inc.i18nfix
+from inc.textproc import cut_news_text
+from inc.fileproc import copy_files
+
+
+class gen_site:
+    def load_config(self, name="www.yml"):
+        yaml = YAML(typ='safe')
+        site_configfile = Path(name)
+        return yaml.load(site_configfile)
+
+    def gen_abstract(self, conf, name, member, pages, length):
+        for item in conf[name]:
+            item[member] = cut_news_text(item[pages], length)
+
+    def run(self, root, conf, env):
+        # os.chdir("..")
+        print(os.getcwd())
+        root = "../" + root
+        for in_file in glob.glob(root + "/*.j2"):
+            name, ext = re.match(r"(.*)\.([^.]+)$",
+                                 in_file.rstrip(".j2")).groups()
+            tmpl = env.get_template(in_file)
+
+            def self_localized(other_locale):
+                """
+                Return URL for the current page in another locale.
+                """
+                return "../" + other_locale + "/" + in_file.replace(
+                    root + '/', '').rstrip(".j2")
+
+            def url_localized(filename):
+                if root == "news":
+                    return "../../" + locale + "/" + filename
+                else:
+                    return "../" + locale + "/" + filename
+
+            def url_static(filename):
+                if root == "news":
+                    return "../../static/" + filename
+                else:
+                    return "../static/" + filename
+
+            def url_dist(filename):
+                if root == "news":
+                    return "../../dist/" + filename
+                else:
+                    return "../dist/" + filename
+
+            def svg_localized(filename):
+                lf = filename + "." + locale + ".svg"
+                if locale == "en" or not Path(lf).is_file():
+                    return "../" + filename + ".svg"
+                else:
+                    return "../" + lf
+
+            def url(x):
+                # TODO: look at the app root environment variable
+                # TODO: check if file exists
+                #if root == "news":
+                #    return "../" + "../" + x
+                #else:
+                #    return "../" + x
+                return "../" + x
+
+            for l in glob.glob("locale/*/"):
+                locale = os.path.basename(l[:-1])
+
+                tr = gettext.translation("messages",
+                                         localedir="locale",
+                                         languages=[locale])
+
+                tr.gettext = i18nfix.wrap_gettext(tr.gettext)
+
+                env.install_gettext_translations(tr, newstyle=True)
+
+                content = tmpl.render(lang=locale,
+                                      lang_full=conf["langs_full"][locale],
+                                      url=url,
+                                      meetingnotesdata=conf["meetingnotes"],
+                                      newsdata=conf["newsposts"],
+                                      videosdata=conf["videoslist"],
+                                      self_localized=self_localized,
+                                      url_localized=url_localized,
+                                      url_static=url_static,
+                                      url_dist=url_dist,
+                                      svg_localized=svg_localized,
+                                      filename=name + "." + ext)
+
+                if root == "news":
+                    out_name = "./rendered/" + locale + "/" + root + "/" + 
in_file.replace(
+                        root + '/', '').rstrip(".j2")
+                else:
+                    out_name = "./rendered/" + locale + "/" + in_file.replace(
+                        root + '/', '').rstrip(".j2")
+
+                outdir = Path("rendered")
+                if outdir.exists() is False:
+                    sys.exit(1)
+
+                if root == "news":
+                    langdir = outdir / locale / root
+                else:
+                    langdir = outdir / locale
+
+                try:
+                    langdir.mkdir(parents=True, exist_ok=True)
+                except e as FileNotFoundError:
+                    print(e)
+
+                with codecs.open(out_name, "w", encoding='utf-8') as f:
+                    try:
+                        print(Path.cwd())
+                        f.write(content)
+                    except e as Error:
+                        print(e)
diff --git a/inc/sitemap.py b/inc/sitemap.py
new file mode 100644
index 0000000..e050c77
--- /dev/null
+++ b/inc/sitemap.py
@@ -0,0 +1,20 @@
+import os
+from pathlib import Path, PurePosixPath
+
+def sitemap_tree(path):
+    tree = dict(name=PurePosixPath(path).name, children=[])
+    try:
+        mylist = os.listdir(path)
+    except OSError:
+        pass
+    else:
+        for name in mylist:
+            fn = os.path.join(path, name)
+            if os.path.isdir(fn):
+                tree['children'].append(sitemap_tree(fn))
+            else:
+                np = os.path.join(name)
+                if np.startswith('/'):
+                    np = np[1:]
+                tree['children'].append(dict(name=np))
+    return tree
diff --git a/inc/sum.py b/inc/sum.py
new file mode 100644
index 0000000..9addf78
--- /dev/null
+++ b/inc/sum.py
@@ -0,0 +1,35 @@
+def sha256sum(_):
+    sha256 = hashlib.sha256()
+    with io.open(_, mode="rb") as fd:
+        content = fd.read()
+        sha256.update(content)
+    return sha256.hexdigest()
+
+
+def walksum(_):
+    sha256 = hashlib.sha256()
+    x = Path(_)
+    if not x.exists():
+        return -1
+    try:
+        for root, directories, files in os.walk(_):
+            for names in sorted(files):
+                filepath = os.path.join(root, names)
+                try:
+                    fl = open(filepath, 'rb')
+                except:
+                    fl.close()
+                    continue
+                while 1:
+                    buf = fl.read(4096)
+                    if not buf:
+                        break
+                    sha256.update(hashlib.sha256(buf).hexdigest())
+                fl.close()
+    except:
+        import traceback
+        traceback.print_exc()
+        return -2
+    return sha256.hexdigest()
+
+
diff --git a/inc/textproc.py b/inc/textproc.py
new file mode 100644
index 0000000..228518b
--- /dev/null
+++ b/inc/textproc.py
@@ -0,0 +1,35 @@
+import html.parser
+from bs4 import BeautifulSoup
+
+class extractText(html.parser.HTMLParser):
+    def __init__(self):
+        super(extractText, self).__init__()
+        self.result = []
+    def handle_data(self, data):
+        self.result.append(data)
+    def text_in(self):
+        return ''.join(self.result)
+
+
+def html2text(html):
+    k = extractText()
+    k.feed(html)
+    return k.text_in()
+
+
+def cut_text(filename, count):
+    with open(filename) as html:
+        soup = BeautifulSoup(html, features="lxml")
+        for script in soup(["script", "style"]):
+            script.extract()
+        k = []
+        for i in soup.findAll('p')[1]:
+            k.append(i)
+        b = ''.join(str(e) for e in k)
+        text = html2text(b.replace("\n", ""))
+        textreduced = (text[:count] + '...') if len(text) > count else (text + 
'..')
+        return(textreduced)
+
+
+def cut_news_text(filename, count):
+    return cut_text("news/" + filename + ".j2", count)
diff --git a/template.py b/template.py
index 89e4856..c4a17e2 100755
--- a/template.py
+++ b/template.py
@@ -16,289 +16,35 @@
 #
 # Note that the gettext files need to be prepared first. This script
 # is thus to be invoked via the Makefile.
-#
-# We import unicode_literals until people have understood how unicode
-# with bytes and strings changed in python2->python3.
-# from __future__ import unicode_literals
-import os
-import os.path
-import sys
-import re
-import gettext
-import glob
-import codecs
-import jinja2
-import i18nfix
-from pathlib import Path
-import hashlib
-from bs4 import BeautifulSoup
-from ruamel.yaml import YAML
-import html.parser
 
+import jinja2
+import os
+from inc.site import gen_site
+from inc.fileproc import copy_files
 
-env = jinja2.Environment(loader=jinja2.FileSystemLoader(
-    os.path.dirname(__file__)),
+env = 
jinja2.Environment(loader=jinja2.FileSystemLoader(os.path.dirname(__file__),"inc"),
                          extensions=["jinja2.ext.i18n"],
                          lstrip_blocks=True,
                          trim_blocks=True,
                          undefined=jinja2.StrictUndefined,
                          autoescape=False)
 
-
-class extractText(html.parser.HTMLParser):
-    def __init__(self):
-        super(extractText, self).__init__()
-        self.result = []
-    def handle_data(self, data):
-        self.result.append(data)
-    def text_in(self):
-        return ''.join(self.result)
-
-
-def html2text(html):
-    k = extractText()
-    k.feed(html)
-    return k.text_in()
-
-
-def localized(filename, locale, *args):
-    if len(args) == 0:
-        return "../" + locale + "/" + filename
-    ext = kwargs.get('ext', None)
-    if ext is not None:
-        lf = filename + "." + locale + "." + ext
-        lp = Path(lf)
-        if locale == "en" or not lp.is_file():
-            return "../" + filename + "." + ext
-        else:
-            return "../" + lf
-
-
-def fileop(infile, outfile, action):
-    """
-    infile: inputfile, Path object
-    outfile: outputfile, Path object
-    action: action if any, String
-    """
-    i = Path(infile)
-    o = Path(outfile)
-    outdir = Path("rendered")
-    if i.is_file() is not False:
-        if action == "copy":
-            # Write content of i to o.
-            o.write_text(i.read_text())
-        if action == "link":
-            o.symlink_to(i)
-
-
-def write_name(filename, infile, locale, replacer):
-    return "./rendered/" + locale + "/" + infile.replace(replacer,
-                                                         '').rstrip(".j2")
-
-
-def sha256sum(_):
-    sha256 = hashlib.sha256()
-    with io.open(_, mode="rb") as fd:
-        content = fd.read()
-        sha256.update(content)
-    return sha256.hexdigest()
-
-
-def walksum(_):
-    sha256 = hashlib.sha256()
-    x = Path(_)
-    if not x.exists():
-        return -1
-    try:
-        for root, directories, files in os.walk(_):
-            for names in sorted(files):
-                filepath = os.path.join(root, names)
-                try:
-                    fl = open(filepath, 'rb')
-                except:
-                    fl.close()
-                    continue
-                while 1:
-                    buf = fl.read(4096)
-                    if not buf:
-                        break
-                    sha256.update(hashlib.sha256(buf).hexdigest())
-                fl.close()
-    except:
-        import traceback
-        traceback.print_exc()
-        return -2
-    return sha256.hexdigest()
-
-
-def rm_rf(directory):
-    directory = Path(directory)
-    for child in directory.glob('*'):
-        if child.is_file():
-            child.unlink()
-        else:
-            rm_rf(child)
-    directory.rmdir()
-
-
-# This generates and switches sites generations, preventing
-# in-place modification of the website.
-# * save old generation directory name
-# * jinja2 creates content in "rendered" (happened before calling this 
function)
-# * calculate sum of "rendered"
-# * move "rendered" to out/$sum
-# * remove symlink "html_dir"
-# * symlink out/$sum to "html_dir"
-# * delete old generation directory
-def generation_dir(htmldir):
-    oldgen = Path(htmldir).resolve()
-    # precondition: jinja2 has created the files in "rendered".
-    newgen = Path("rendered")
-    newgen_sum = walksum(newgen)
-    outdir = Path("out")
-    outdir.mkdir(parents=True, exist_ok=True)
-    newgen_target = Path("out") / newgen_sum
-    newgen.rename(newgen_target)
-    html = Path(htmldir)
-    html.unlink()
-    fileop(newgen, html, "link")
-    rm_rf(oldgen)
-
-
-def copy_static(locale, indict):
-    for key, value in indict.items():
-        print(locale + "/" + key + " ...to... " + locale + "/" + value)
-
-
-def preview_text(filename, count):
-    with open(filename) as html:
-        # html = open(filename).read()
-        soup = BeautifulSoup(html, features="lxml")
-        for script in soup(["script", "style"]):
-            script.extract()
-        k = []
-        # for i in soup.findAll('p')[1:3]:
-        for i in soup.findAll('p')[1]:
-            k.append(i)
-        b = ''.join(str(e) for e in k)
-        text = html2text(b.replace("\n", ""))
-        textreduced = (text[:count] + '...') if len(text) > count else (text + 
'..')
-        return(textreduced)
-
-
-def abstract_news(filename, count):
-    return preview_text("news/" + filename + ".j2", count)
-
-
-def generate_site(root, conf):
-    for in_file in glob.glob(root + "/*.j2"):
-        name, ext = re.match(r"(.*)\.([^.]+)$", in_file.rstrip(".j2")).groups()
-        tmpl = env.get_template(in_file)
-
-        def self_localized(other_locale):
-            """
-            Return URL for the current page in another locale.
-            """
-            return "../" + other_locale + "/" + in_file.replace(
-                root + '/', '').rstrip(".j2")
-
-        def url_localized(filename):
-            if root == "news":
-                return "../../" + locale + "/" + filename
-            else:
-                return "../" + locale + "/" + filename
-
-        def url_static(filename):
-            if root == "news":
-                return "../../static/" + filename
-            else:
-                return "../static/" + filename
-
-        def url_dist(filename):
-            if root == "news":
-                return "../../dist/" + filename
-            else:
-                return "../dist/" + filename
-
-        def svg_localized(filename):
-            lf = filename + "." + locale + ".svg"
-            if locale == "en" or not os.path.isfile(lf):
-                return "../" + filename + ".svg"
-            else:
-                return "../" + lf
-
-        def url(x):
-            # TODO: look at the app root environment variable
-            # TODO: check if file exists
-            #if root == "news":
-            #    return "../" + "../" + x
-            #else:
-            #    return "../" + x
-            return "../" + x
-
-        for l in glob.glob("locale/*/"):
-            locale = os.path.basename(l[:-1])
-
-            tr = gettext.translation("messages",
-                                     localedir="locale",
-                                     languages=[locale])
-
-            tr.gettext = i18nfix.wrap_gettext(tr.gettext)
-
-            env.install_gettext_translations(tr, newstyle=True)
-
-            content = tmpl.render(lang=locale,
-                                  lang_full=conf["langs_full"][locale],
-                                  url=url,
-                                  meetingnotesdata=conf["meetingnotes"],
-                                  newsdata=conf["newsposts"],
-                                  videosdata=conf["videoslist"],
-                                  self_localized=self_localized,
-                                  url_localized=url_localized,
-                                  url_static=url_static,
-                                  url_dist=url_dist,
-                                  svg_localized=svg_localized,
-                                  filename=name + "." + ext)
-
-            if root == "news":
-                out_name = "./rendered/" + locale + "/" + root + "/" + 
in_file.replace(
-                    root + '/', '').rstrip(".j2")
-            else:
-                out_name = "./rendered/" + locale + "/" + in_file.replace(
-                    root + '/', '').rstrip(".j2")
-
-            outdir = Path("rendered")
-
-            if root == "news":
-                langdir = outdir / locale / root
-            else:
-                langdir = outdir / locale
-
-            langdir.mkdir(parents=True, exist_ok=True)
-
-            with codecs.open(out_name, "w", encoding='utf-8') as f:
-                f.write(content)
-
-
 def main():
     # rm_rf("rendered")
-    yaml=YAML(typ='safe')
-    site_configfile=Path("www.yml")
-    conf=yaml.load(site_configfile)
-
-    for item in conf["newsposts"]:
-        item['abstract'] = abstract_news(item['page'], 1000)
-    print("generating template")
-    generate_site("template", conf)
-    print("generating news")
-    generate_site("news", conf)
-
-#    for l in glob.glob("locale/*/"):
-#        locale = os.path.basename(l[:-1])
-#        copy_static (locale, staticfiles)
-# generate_rss
-#print("running generation")
-#generation_dir
+    x = gen_site()
+    conf = x.load_config("www.yml")
+    print("generating news abstracts...")
+    x.gen_abstract(conf, "newsposts", "abstract", "page", 1000)
+    print("generating html from jinja2 templates...")
+    x.run("template", conf, env)
+    print("generating html from jinja2 news templates...")
+    x.run("news", conf, env)
+    #for lang in conf["langs_full"]:
+    #    copy_files("static", conf, lang, "staticfiles", "rendered")
+    # print("generating rss...")
+    # x.generate_rss()
+    # print("generating sitemap...")
+    # x.generate_sitemap()
 
 if __name__ == "__main__":
     main()
diff --git a/www.yml b/www.yml
index 4694373..ce0fa3e 100644
--- a/www.yml
+++ b/www.yml
@@ -5,22 +5,40 @@ langs_full: &langs
     es: Español
     de: Deutsch
 symlinks:
-    frontpage.html: frontpage
-    gsoc.html: gsoc
-    about.html: philosophy
-    gns.html: gns
-    node/about.html: "397"
-# Mostly from static/ to rendered/
+    - file: frontpage.html
+      targets:
+        - frontpage
+    - file: gsoc.html
+      targets:
+        - gsoc
+    - file: about.html
+      targets:
+        - philosophy
+    - file: gns.html
+      targets:
+        - gns
+    - file: node/about.html
+      targets:
+        - "397"
 staticfiles:
-    favicon.ico: favicon.ico
-    moved.html: frontpage.html
-    robots.txt:
-      - static
-      - dist
-      - *langs
-    moved_gsoc.html: gsoc.html
-    moved_about.html: about.html
-    moved_gns.html: gns.html
+    - file: favicon.ico
+      targets:
+        - favicon.ico
+    - file: moved.html
+      targets:
+        - frontpage.html
+    - file: robots.txt
+      targets:
+        - robots.txt
+    - file: moved_gsoc.html
+      targets:
+        - gsoc.html
+    - file: moved_about.html
+      targets:
+        - about.html
+    - file: moved_gns.html
+      targets:
+        - gns.html
 meetingnotes:
   2013: 2013-12-27
   2014: 2014-12-28

-- 
To stop receiving notification emails like this one, please contact
address@hidden.
[Prev in Thread]
Current Thread
[Next in Thread]
[www] 01/19: drop guix-env.scm, no longer maintained., (continued)
- [www] 01/19: drop guix-env.scm, no longer maintained., gnunet, 2019/11/12
- [www] 06/19: remove FIXME., gnunet, 2019/11/12
- [www] 09/19: move jinja2 ext to inc., gnunet, 2019/11/12
- [www] 07/19: remove taler flyer, leftover from initial base., gnunet, 2019/11/12
- [www] 15/19: build-system, gnunet, 2019/11/12
- [www] 14/19: add submodule for www_shared -> inc., gnunet, 2019/11/12
- [www] 08/19: minor syntax fixes in engage., gnunet, 2019/11/12
- [www] 16/19: inc, new commits, gnunet, 2019/11/12
- [www] 13/19: move inc to www_shared repo, gnunet, 2019/11/12
- [www] 12/19: inc: more logic to remove code from Makefile. news/index.hmtl.j2: syntax., gnunet, 2019/11/12
- [www] 10/19: split up template.py, make site generation a class., gnunet <=
- [www] 17/19: use make_sitemap.py, gnunet, 2019/11/12
- [www] 19/19: make build more silent again., gnunet, 2019/11/12
- [www] 18/19: inc, gnunet, 2019/11/12
- [www] 11/19: rename template.py to make_site.py, gnunet, 2019/11/12
Prev by Date: [www] 12/19: inc: more logic to remove code from Makefile. news/index.hmtl.j2: syntax.
Next by Date: [www] 17/19: use make_sitemap.py
Previous by thread: [www] 12/19: inc: more logic to remove code from Makefile. news/index.hmtl.j2: syntax.
Next by thread: [www] 17/19: use make_sitemap.py
Index(es):
- Date
- Thread