Package: rtika 3.2.3
rtika: R Interface to 'Apache Tika'
Extract text or metadata from over a thousand file types, using Apache Tika <https://tika.apache.org/>. Get either plain text or structured XHTML content.
Authors:
rtika_3.2.3.tar.gz
rtika_3.2.3.zip(r-4.6)rtika_3.2.3.zip(r-4.5)rtika_3.2.3.zip(r-4.4)
rtika_3.2.3.tgz(r-4.5-any)rtika_3.2.3.tgz(r-4.4-any)
rtika_3.2.3.tar.gz(r-4.6-any)rtika_3.2.3.tar.gz(r-4.5-any)
rtika_3.2.3.tgz(r-4.5-emscripten)
rtika.pdf |rtika.html✨
rtika/json (API)
NEWS
| # Install 'rtika' in R: |
| install.packages('rtika', repos = c('https://packages.ropensci.org', 'https://cloud.r-project.org')) |
Reviews:rOpenSci Software Review #191
Bug tracker:https://github.com/ropensci/rtika/issues
Pkgdown/docs site:https://docs.ropensci.org
extract-metadataextract-textjavaparsepdf-filespeer-reviewedtesseracttika
Last updated from:350fcc3e90 (on master). Checks:10 OK. Indexed: yes.
| Target | Result | Total time | Artifact |
|---|---|---|---|
| linux-devel-x86_64 | OK | 111 | |
| pkgdown docs | OK | 148 | |
| source / vignettes | OK | 256 | |
| linux-release-x86_64 | OK | 134 | |
| macos-release-arm64 | OK | 91 | |
| macos-oldrel-arm64 | OK | 127 | |
| windows-devel | OK | 87 | |
| windows-release | OK | 94 | |
| windows-oldrel | OK | 101 | |
| wasm-release | OK | 132 |
Exports:install_tikajavatikatika_checktika_fetchtika_htmltika_jartika_jsontika_json_texttika_texttika_xml
Readme and manuals
Help Manual
| Help page | Topics |
|---|---|
| Install or Update the Apache Tika 'jar' | install_tika |
| System Command to Run Java | java |
| Main R Interface to 'Apache Tika' | tika |
| Check Tika against a checksum | tika_check |
| Fetch Files with the Content-Type Preserved in the File Extension | tika_fetch |
| Get Structured XHTML | tika_html |
| Path to Apache Tika | tika_jar |
| Get json Metadata and XHTML Content | tika_json |
| Get json Metadata and Plain Text Content | tika_json_text |
| Get Plain Text | tika_text |
| Get a Structured XHTML Rendition | tika_xml |
