https://github.com/opendatalab/MinerU?tab=readme-ov-file#online-demo
https://github.com/opendatalab/MinerU
pdf格式转换:
(X:\CondaEnvs\MinerU) PS C:\Users\ZMG> magick --version
Version: ImageMagick 7.1.1-38 Q16 x64 b0ab922:20240901 https://imagemagick.org
Copyright: (C) 1999 ImageMagick Studio LLC
License: https://imagemagick.org/script/license.php
Features: Channel-masks(64-bit) Cipher DPC Modules OpenCL OpenMP(2.0)
Delegates (built-in): bzlib cairo flif freetype gslib heic jng jp2 jpeg jxl lcms lqr lzma openexr pangocairo png ps raqm raw rsvg tiff webp xml zip zlib
Compiler: Visual Studio 2022 (194134120)
conda create -n MinerU python=3.10
conda create --prefix X:\MinerU python=3.10
conda activate MinerU
pip install -U magic-pdf[full] detectron2 -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install -U magic-pdf[full] --extra-index-url https://wheels.myhloli.com
pip install magic-pdf[full-cpu] detectron2 --extra-index-url https://myhloli.github.io/wheels/^C
pip install -U magic-pdf[full] -i https://pypi.tuna.tsinghua.edu.cn/simple
启动命令:
magic-pdf -p {some_pdf} -o {some_output_dir} -m auto
### 激活虚拟环境:
MinerU * X:\CondaEnvs\MinerU
magic-pdf -p "X:\MinerU\demo1.pdf" -o "X:\MinerU\output" -m auto