Opened 20 years ago

Closed 20 years ago

Last modified 20 years ago

#61 closed defect (released)

Unknown Exception from scriptindex when indexing a very long document

Reported by: Arjen Owned by: Olly Betts
Priority: high Milestone:
Component: Omega Version: 0.8.4
Severity: critical Keywords:
Cc: Blocked By:
Blocking: Operating System: Linux

Description

When testing something else I noticed that scriptindex didn't index a specific batch of documents, outputted the message "Unknown Exception" and stopped that batch.

I had a suspicion about a certain document in that set and seperated it from the batch. When I tried to create a new database with only that document, I got the same exception. I'm not sure whether its the documents length (its the longest we have, about 5.5MB in plain text) or its content.

Since the document is not entirely free for me to attach along, I've e-mailed a bzip2ed-version of it to Olly.

Change History (2)

comment:1 by Olly Betts, 20 years ago

Resolution: fixed
Status: newclosed

The problem is trying to "unhtml" text which contains "</body>".

Fixed in CVS HEAD.

comment:2 by Olly Betts, 20 years ago

Operating System: Linux
Resolution: fixedreleased

Fixed in 0.8.5.

Note: See TracTickets for help on using tickets.