# Web Content Extraction Reference This document provides detailed information about extracting content from HTML, YouTube, EPUB, and other web-based formats. ## HTML Conversion Convert HTML files and web pages to clean Markdown format. ### Basic HTML Conversion ```python from markitdown import MarkItDown md = MarkItDown() result = md.convert("webpage.html") print(result.text_content) ``` ### HTML Processing Features **What's preserved:** - Headings (`