This is a ByteDance PDF parsing tool that converts PDFs and scanned documents into structured Markdown or plain text, with support for tables and multi-column layouts. It's built on the Doubao service and handles OCR automatically. The workflow is straightforward: upload your PDF to TOS storage, set your parsing parameters (normal or detailed mode, page ranges, output paths), get a price estimate, then run the extraction. It can process up to 400 pages and outputs results with images and JSON metadata. The documentation is thorough about avoiding credential leaks and following a strict step-by-step process. If you're working within the ByteDance ecosystem and need reliable document extraction with cost visibility upfront, this does the job without much fuss.
npx -y skills add bytedance/agentkit-samples --skill byted-las-pdf-parse-doubao --agent claude-codeInstalls into .claude/skills of the current project.
Select a file.
larksuite/cli
googleworkspace/cli
googleworkspace/cli
googleworkspace/cli