lynx-dev
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Lynx-dev] Extract links from html with application/ld+json script


From: Super Bonaci
Subject: [Lynx-dev] Extract links from html with application/ld+json script
Date: Sun, 17 Dec 2023 19:31:33 +0000 (UTC)

Version in use: Lynx Version 2.8.9rel.1 (08 Jul 2018)

Some html pages contain <script type="application/ld+json"> content, for 
example:

wget -E 'https://www.twitch.tv/egctv/videos?filter=all&sort=time' -O twitch.html

Wether the html is embedded or not depends on the wget or curl flags which are 
used.

The twitch.html sample can be browsed here:
https://controlc.com/9ed7a8bb
https://pastebin.com/87edaepd

Lynx is not able to extract most html links inside the html file.

Since the Lynx version is from 2018 probably that's the cause, being too old 
and not supporting new formats.
Could this issue be fixed?

bye.


reply via email to

[Prev in Thread] Current Thread [Next in Thread]