Ticket #61 (closed defect: released)

Opened 4 years ago

Last modified 4 years ago

Unknown Exception from scriptindex when indexing a very long document

Reported by: arjen Owned by: olly
Priority: high Milestone:
Component: Omega Version: 0.8.4
Severity: critical Keywords:
Cc: Blocked By:
Operating System: Linux Blocking:

Description

When testing something else I noticed that scriptindex didn't index a specific batch of documents, outputted the message "Unknown Exception" and stopped that batch.

I had a suspicion about a certain document in that set and seperated it from the batch. When I tried to create a new database with only that document, I got the same exception. I'm not sure whether its the documents length (its the longest we have, about 5.5MB in plain text) or its content.

Since the document is not entirely free for me to attach along, I've e-mailed a bzip2ed-version of it to Olly.

Change History

Changed 4 years ago by olly

  • status changed from new to closed
  • resolution set to fixed

The problem is trying to "unhtml" text which contains "</body>".

Fixed in CVS HEAD.

Changed 4 years ago by olly

  • resolution changed from fixed to released

Fixed in 0.8.5.

Changed 4 years ago by trac

  • platform set to Linux
Note: See TracTickets for help on using tickets.