diff options
Diffstat (limited to 'www/blog/2023-01-06')
-rw-r--r-- | www/blog/2023-01-06/index.html | 184 |
1 files changed, 184 insertions, 0 deletions
diff --git a/www/blog/2023-01-06/index.html b/www/blog/2023-01-06/index.html new file mode 100644 index 0000000..e5a113b --- /dev/null +++ b/www/blog/2023-01-06/index.html @@ -0,0 +1,184 @@ +<!DOCTYPE html> +<html lang="en"> +<head> + <meta charset="utf-8"> + <meta name="generator" + content="HTML Tidy for HTML5 for Linux version 5.6.0"> + <title>senders.io - How I Generate My RSS Feed</title> + <link rel='stylesheet' + type='text/css' + href='/index.css'> + <meta name="viewport" + content="width=device-width, initial-scale=1"> +</head> +<body> + <div id='header'> + <a class='title' + href='/'>senders.io</a> + <nav> + <a href="/blog">blog</a> <a rel="external noopener noreferrer" + target="_blank" + href="https://github.com/s3nd3r5">github</a> <a rel= + "external noopener noreferrer" + target="_blank" + href="https://git.senders.io">cgit</a> <a rel= + "me external noopener noreferrer" + target="_blank" + href="https://mastodon.online/@senders">mastodon</a> + </nav> + </div> + <div id="body"> + <article> + <h1>How I Generate My RSS Feed</h1> + <p>I only just now started supplying an RSS feed to you fine people! You + can subscribe to it at <a href= + "/blog/feed.rss">www.senders.io/blog/feed.rss</a>!</p> + <p>I decided rather than manually generating the file contents I’d hook + into my pre-existing publish scripts to be able to generate the RSS + file.</p> + <h2>Publishing blog posts - shell scripts ftw</h2> + <p>In <a href="/blog/2022-11-06/">My Markdown -> HTML Setup</a> I + touch on how I publish my markdown files into HTML for this blog. But + what I don’t <em>really</em> touch on is the shell scripts that tie the + whole process together.</p> + <p>What I have is two, now three, scripts that feed the whole + process:</p> + <ol> + <li><code>publish-blog.sh</code> - the main script</li> + <li><code>compile-md.sh</code> - generates the HTML output</li> + <li><code>update-feed.sh</code> - generates/appends the RSS feed</li> + </ol> + <p>The <code>update-feed.sh</code> script is the new one I just + added.</p> + <p><code>publish-blog.sh</code> is the primary interface, I supply the + date of the post and the path to the md file and that calls compile and + update to automate the entire process.</p> + <p>Without going into TOO much detail you can view the latest versions of + the scripts at <a rel="external noopener noreferrer" + target="_blank" + href= + "https://git.senders.io/senders/senders-io/tree/">git.senders.io/senders/senders-io/tree/</a>.</p> + <p>But the gist of the scripts is I parse out the necessary details, + find/replace some tokens in template files I have setup for headers and + footers, and concat the outputs into the final output HTML files, and now + RSS feed.</p> + <h3>update-feed.sh</h3> + <p>Source File: <a rel="external noopener noreferrer" + target="_blank" + href= + "https://git.senders.io/senders/senders-io/tree/update-feed.sh">git.senders.io/senders/senders-io/tree/update-feed.sh</a></p> + <p>This script is pretty interesting. I didn’t want to deal with any XML + parsers and libraries to just maintain a proper XML rss file and push + items into the tree. Rather, I just follow a similar setup to my markdown + generation. I leverage some temporary files to hold the contents, a + static temp file for the previously generated content, and at the end + swap the temp file with the real file.</p> + <p>I take in an input of the publish date (this is the date from the + publish script), the title, and the HTML file path. These are all already + variables in the publish script, but also something I can manually supply + if I need to publish an older article, or something I wrote directly in + HTML.</p> + <p>The core of the script is found here:</p> + <pre><code>PUBDATE=$(date -d "$1" -R) +TITLE=$2 +FILE_PATH=$3 +PERMALINK=$(echo "${FILE_PATH}" | sed -e "s,${TKN_URL_STRIP},${URL_PREFIX},g") +LINK=$(echo "${PERMALINK}" | sed -e "s,${TKN_INDEX_STRIP},,g") + +# Generate TMP FEED File Header + +cat -s $FILE_RSS_HEADER > $FILE_TMP_FEED +sed -i -E "s/${TKN_BUILDDATE}/${BUILDDATE}/g" $FILE_TMP_FEED +sed -i -E "s/${TKN_PUBDATE}/${PUBDATE}/g" $FILE_TMP_FEED + +# Generate TMP Item File + +cat -s $FILE_RSS_ITEM_HEADER > $FILE_TMP_ITEM +sed -i -E "s~${TKN_TITLE}~${TITLE}~g" $FILE_TMP_ITEM +sed -i -E "s/${TKN_PUBDATE}/${PUBDATE}/g" $FILE_TMP_ITEM +sed -i -E "s,${TKN_PERMALINK},${PERMALINK},g" $FILE_TMP_ITEM +sed -i -E "s,${TKN_LINK},${LINK},g" $FILE_TMP_ITEM +sed -n "/<article>/,/<\/article>/p" $FILE_PATH >> $FILE_TMP_ITEM +cat -s $FILE_RSS_ITEM_FOOTER >> $FILE_TMP_ITEM + +# Prepend Item to items list and overwrite items file w/ prepended item +## In order to "prepend" the item (so it's on top of the others) +## We need to concat the tmp item file with the existing list, then +## we can push the contents over the existing file +## We use cat -s to squeeze the blank lines +cat -s $FILE_ITEM_OUTPUT >> $FILE_TMP_ITEM +cat -s $FILE_TMP_ITEM > $FILE_ITEM_OUTPUT + +# Push items to TMP FEED +cat -s $FILE_ITEM_OUTPUT >> $FILE_TMP_FEED + +# Push RSS footer to TMP FEED +cat -s $FILE_RSS_FOOTER >> $FILE_TMP_FEED +echo $FILE_TMP_FEED + +# Publish feed +cat -s $FILE_TMP_FEED > $FILE_RSS_OUTPUT + +echo "Finished generating feed" +</code></pre> + <p>Some key takeaways are:</p> + <ol> + <li>sed lets you do regex with delimiters that AREN’T <code>/</code> so + you can substitute something that shouldn’t actually ever show up in + your regex. For me that is <code>~</code>.</li> + <li>I always forget you can use sed to extract between tokens - which + is how I get the CDATA for the RSS: <code>sed -n + "/<article>/,/<\/article>/p"</code></li> + <li><code>mktemp</code> is really REALLY useful - and I feel is under + utilized in shellscripting</li> + </ol> + <p>The obvious cracks are:</p> + <ol> + <li>I rely SO much on <code>sed</code> that it’s almost certainly going + to break</li> + <li>I don’t have much other flag control to do partial generation - so + if I need to do something either starting partway through or not finish + the full process, I don’t have that.</li> + <li>Sometimes things can break silently and it will go through, there + is no verification or like manual checking along the way before + publishing the feed.rss</li> + </ol> + <p>The final two can easily be managed by writing the feed to a location + that isn’t a temp file and I can manually do the <code>cat -s + $FILE_TMP_FEED > www/blog/feed.rss</code> myself after I check it + over.</p> + <p>But for now I’ll see if I ever have to redo it. I don’t think anyone + will actually sub to this so I don’t really need to care that much if I + amend the feed.</p> + <h2>Where to put the feed URL</h2> + <p>I never intended to provide an RSS feed. I doubt anyone but me reads + this, and from my previous experience with gemini feed generation was a + bit of a headache.</p> + <p>A quick aside: I really only decided thanks to Mastodon. I was + thinking during the Twitter meltdown “what if twitter but RSS” (I know + super unique idea). But basically like a true “microblog”. And some OSS + tools to publish your blog. This got me reading the RSS spec and looking + into it more - which then lead me down the using the RSS readers more (in + conjunction with gemini, and Cortex podcast talking about using RSS + more).</p> + <p>But I’ve decided to just put the RSS feed in the blog index, on my + homepage, and that’s it. I don’t need it permanently in the header.</p> + <h2>Conclusion</h2> + <p>I didn’t have much to share here, it doesn’t make too much sense to + write a big post on what can be explained better by just checking out the + shell scripts in my git source. The code speaks better than I ever + could.</p> + <p>I really, really like shell scripting.</p> + </article> + <div id="footer"> + <i>January 06, 2023</i> + </div> + <div id='copyright'> + © 2023 senders dot io - <a rel="license external noopener noreferrer" + target="_blank" + href="https://creativecommons.org/licenses/by/4.0/">CC BY-SA 4.0</a> + unless otherwise noted. + </div> + </div> +</body> +</html> |