SkillHub

pls-url-to-markdown

v1.0.0

Fetch URLs and convert web pages to clean Markdown for AI processing or knowledge bases. Use when: (1) Researching web content, (2) Building knowledge bases, (3) Extracting articles, (4) Converting pages for docs.

Sourced from ClawHub, Authored by Matt Valenta

Installation

Please help me install the skill `pls-url-to-markdown` from SkillHub official store. npx skills add mattvalenta/pls-url-to-markdown

URL to Markdown Converter

Fetches URLs and converts web pages to clean Markdown.

Quick Start

Python Method (markdownify)

pip install requests beautifulsoup4 markdownify

python3 -c "... fetching and converting URL ..."

CLI Tools (html2text, pandoc)

curl -s URL | html2text
wget -q -O - URL | pandoc -f html -t markdown

Full Extraction Script

import requests
from bs4 import BeautifulSoup
from markdownify import markdownify as md

def url_to_markdown(url, output_file=None):
    # ... fetch, parse, convert logic ...
    pass

Content Extraction Patterns

Extract Article Body

def extract_article(html):
    soup = BeautifulSoup(html, 'html.parser')
    article = soup.find('article') or soup.find('main')
    return md(str(article)) if article else None

Preserve Code Blocks

def preserve_code(html):
    # ... logic to wrap code in ``` ...
    pass

CLI Usage

python url_to_markdown.py URL -o output.md

Error Handling

def safe_fetch(url, retries=3):
    # ... retry logic ...
    pass