From 816893e009e69bb6a13994fbf35197489916c234 Mon Sep 17 00:00:00 2001 From: zYne Date: Thu, 12 Jul 2007 11:50:54 +0000 Subject: [PATCH] --- manual/new/docs/en.txt | 1 + manual/new/docs/en/searching.txt | 63 ++++++++++++++++++++++++++++++++ 2 files changed, 64 insertions(+) create mode 100644 manual/new/docs/en/searching.txt diff --git a/manual/new/docs/en.txt b/manual/new/docs/en.txt index f8ab32d1c..b19d0cbbd 100644 --- a/manual/new/docs/en.txt +++ b/manual/new/docs/en.txt @@ -10,6 +10,7 @@ + Caching + Event listeners + Plugins ++ Searching + Database abstraction + Improving Performance + Technology diff --git a/manual/new/docs/en/searching.txt b/manual/new/docs/en/searching.txt new file mode 100644 index 000000000..b57511842 --- /dev/null +++ b/manual/new/docs/en/searching.txt @@ -0,0 +1,63 @@ +++ Introduction + +Searching is a huge topic, hence an entire chapter has been devoted to a plugin called Doctrine_Search. Doctrine_Search is a fulltext indexing and searching tool similar to Apache Lucene. + +Consider we have a class called NewsItem with the following definition: + + +class NewsItem extends Doctrine_Record +{ + public function setTableDefinition() + { + $this->hasColumn('title', 'string', 200); + $this->hasColumn('content', 'string'); + } +} + + +Now lets say we have an application where users are allowed to search for different news items, an obvious way to implement this would be building a form and based on that form build DQL queries such as: + +SELECT n.* FROM NewsItem n WHERE n.title LIKE ? OR n.content LIKE ? + + +However soon as the application grows these kind of queries become very slow. For example when using the previous query with parameters '%framework%' and '%framework%' (this would be equivalent of 'find all news items whose title or content contains word 'framework') the database would have to traverse trhough each row in the table, which would naturally be very very slow. + +Doctrine solves this with its search component and inverse indexes. First lets alter our definition a bit: + + +class NewsItem extends Doctrine_Record +{ + public function setTableDefinition() + { + $this->hasColumn('title', 'string', 200); + $this->hasColumn('content', 'string'); + } + public function setUp() + { + $this->loadTemplate('Doctrine_Search_Template', + array('fields' => array('title', 'content'))); + } +} + + +Here we tell Doctrine that NewsItem class uses Doctrine_Search_Template and fields title and content are marked as fulltext indexed fields. This means that everytime a NewsItem is added or updated Doctrine will: + +1. Update the inverse search index or +2. Add new pending entry to the inverse search index (its efficient to update the inverse search index in batches) + +++ Index structure + +The structure of the inverse index Doctrine uses is the following: + +[ (string) keyword] [ (string) field ] [ (integer) position ] [ (mixed) [foreign_keys] ] + +* **keyword** is the keyword in the text that can be searched for +* **field** is the field where the keyword was found +* **position** is the position where the keyword was found +* **[foreign_keys]** either one or multiple fields depending on the owner component (here NewsItem) + +In the NewsItem example the [foreign_keys] would simple contain one field newsitem_id with foreign key references to NewsItem(id) and with onDelete => CASCADE constraint. + +++ Index building + +