Package: rtika 2.7.0
rtika: R Interface to 'Apache Tika'
Extract text or metadata from over a thousand file types, using Apache Tika <https://tika.apache.org/>. Get either plain text or structured XHTML content.
Authors:
rtika_2.7.0.tar.gz
rtika_2.7.0.zip(r-4.6)rtika_2.7.0.zip(r-4.5)rtika_2.7.0.zip(r-4.4)
rtika_2.7.0.tgz(r-4.5-any)rtika_2.7.0.tgz(r-4.4-any)
rtika_2.7.0.tar.gz(r-4.6-any)rtika_2.7.0.tar.gz(r-4.5-any)
rtika_2.7.0.tgz(r-4.4-emscripten)
rtika.pdf |rtika.html✨
rtika/json (API)
NEWS
# Install 'rtika' in R: |
install.packages('rtika', repos = c('https://ropensci.r-universe.dev', 'https://cloud.r-project.org')) |
Reviews:rOpenSci Software Review #191
Bug tracker:https://github.com/ropensci/rtika/issues
Pkgdown site:https://docs.ropensci.org
extract-metadataextract-textjavaparsepdf-filespeer-reviewedtesseracttika
Last updated 2 years agofrom:64f4be7c75 (on master). Checks:10 OK. Indexed: yes.
Target | Result | Total time |
---|---|---|
source / vignettes | OK | 171 |
pkgdown docs | OK | 280 |
linux-devel-x86_64 | OK | 121 |
linux-release-x86_64 | OK | 149 |
macos-release-arm64 | OK | 68 |
macos-oldrel-arm64 | OK | 105 |
windows-devel | OK | 75 |
windows-release | OK | 100 |
windows-oldrel | OK | 161 |
wasm-release | OK | 107 |
Exports:install_tikajavatikatika_checktika_fetchtika_htmltika_jartika_jsontika_json_texttika_texttika_xml
Readme and manuals
Help Manual
Help page | Topics |
---|---|
Install or Update the Apache Tika 'jar' | install_tika |
System Command to Run Java | java |
rtika: R Interface to 'Apache Tika' | rtika |
Main R Interface to 'Apache Tika' | tika |
Check Tika against a checksum | tika_check |
Fetch Files with the Content-Type Preserved in the File Extension | tika_fetch |
Get Structured XHTML | tika_html |
Path to Apache Tika | tika_jar |
Get json Metadata and XHTML Content | tika_json |
Get json Metadata and Plain Text Content | tika_json_text |
Get Plain Text | tika_text |
Get a Structured XHTML Rendition | tika_xml |