Package: rtika 3.2.3
rtika: R Interface to 'Apache Tika'
Extract text or metadata from over a thousand file types, using Apache Tika <https://tika.apache.org/>. Get either plain text or structured XHTML content.
Authors:
rtika_3.2.3.tar.gz
rtika_3.2.3.zip(r-4.6)rtika_3.2.3.zip(r-4.5)rtika_3.2.3.zip(r-4.4)
rtika_3.2.3.tgz(r-4.5-any)rtika_3.2.3.tgz(r-4.4-any)
rtika_3.2.3.tar.gz(r-4.6-any)rtika_3.2.3.tar.gz(r-4.5-any)
rtika_3.2.3.tgz(r-4.5-emscripten)
rtika.pdf |rtika.html✨
rtika/json (API)
NEWS
# Install 'rtika' in R: |
install.packages('rtika', repos = c('https://packages.ropensci.org', 'https://cloud.r-project.org')) |
Reviews:rOpenSci Software Review #191
Bug tracker:https://github.com/ropensci/rtika/issues
Pkgdown/docs site:https://docs.ropensci.org
extract-metadataextract-textjavaparsepdf-filespeer-reviewedtesseracttika
Last updated from:350fcc3e90 (on master). Checks:10 OK. Indexed: yes.
Target | Result | Total time | Artifact |
---|---|---|---|
linux-devel-x86_64 | OK | 135 | |
pkgdown docs | OK | 184 | |
source / vignettes | OK | 279 | |
linux-release-x86_64 | OK | 128 | |
macos-release-arm64 | OK | 76 | |
macos-oldrel-arm64 | OK | 101 | |
windows-devel | OK | 69 | |
windows-release | OK | 109 | |
windows-oldrel | OK | 107 | |
wasm-release | OK | 140 |
Exports:install_tikajavatikatika_checktika_fetchtika_htmltika_jartika_jsontika_json_texttika_texttika_xml
Readme and manuals
Help Manual
Help page | Topics |
---|---|
Install or Update the Apache Tika 'jar' | install_tika |
System Command to Run Java | java |
Main R Interface to 'Apache Tika' | tika |
Check Tika against a checksum | tika_check |
Fetch Files with the Content-Type Preserved in the File Extension | tika_fetch |
Get Structured XHTML | tika_html |
Path to Apache Tika | tika_jar |
Get json Metadata and XHTML Content | tika_json |
Get json Metadata and Plain Text Content | tika_json_text |
Get Plain Text | tika_text |
Get a Structured XHTML Rendition | tika_xml |