Ticket #114 (assigned enhancement)
Use libmagic or libextractor instead of own MIME mappings and extractions
| Reported by: | nemesis | Owned by: | olly |
|---|---|---|---|
| Priority: | normal | Milestone: | 1.1.1 |
| Component: | Omega | Version: | SVN trunk |
| Severity: | minor | Keywords: | |
| Cc: | Blocked By: | ||
| Operating System: | All | Blocking: |
Description (last modified by olly) (diff)
Hello,
I locally first modified omindex to use libmagic's MIME database, instead of hard coding the MIME type to file extension mapping. This ensures that the internally used MIME types are more consistent with accepted standard types.
Then I went further and instead of using file extensions to determine type, used libmagic to fingerprint the files. This is slower, but ensures that the file actually is identified correctly even if the extension is wrong.
Now I am using libextractor to actually extract the metadata from the file, instead of calling these external programs inside omindex based on the MIME type. Using libextractor greatly simplifies omindex.
Is anyone interested in these modifications?
