Advertisement

Joomla 4 Smart search: so disk space consuming? Topic is solved

Need help with the Administration of your Joomla! 4.x site? This is the spot for you.

Moderator: General Support Moderators

Forum rules
Forum Rules
Absolute Beginner's Guide to Joomla! <-- please read before posting, this means YOU.
Forum Post Assistant - If you are serious about wanting help, you will use this tool to help you post.
Windows Defender SmartScreen Issues <-- please read this if using Windows 10
Locked
User avatar
fabio42
Joomla! Enthusiast
Joomla! Enthusiast
Posts: 152
Joined: Tue Aug 20, 2019 2:39 pm

Joomla 4 Smart search: so disk space consuming?

Post by fabio42 » Tue Jun 27, 2023 6:49 am

Hello !
I've just migrated from Joomla 3 to Joomla 4. Read all the articles concerning smart search and deleted the old Jooma 3 search package. I followed the stepas concerning the indexation of the contents with the smart search component, and suddenly, my database exceeded the limit because of a rise in 97 Mb for the indexation of more than 300 articles...
Is there a way to dodge this or am I "condemned" to buy more disk space from my provider?
Because of course, without indexing the contents, the search box does not return any results...
Thanks for your answer(s)!
Fabrice

Advertisement
SharkyKZ
Joomla! Virtuoso
Joomla! Virtuoso
Posts: 3133
Joined: Fri Jul 05, 2013 10:35 am
Location: Parts Unknown

Re: Joomla 4 Smart search: so disk space consuming?

Post by SharkyKZ » Tue Jun 27, 2023 7:38 am

Did you try optimizing tables? There is Optimise button in the component. Do you have "Search for Phrases" indexing option enabled? And can you clarify if you mean 97 Mb or MB? 97 Mb (about 12 MB) sounds possible for 300 articles.

User avatar
fabio42
Joomla! Enthusiast
Joomla! Enthusiast
Posts: 152
Joined: Tue Aug 20, 2019 2:39 pm

Re: Joomla 4 Smart search: so disk space consuming?

Post by fabio42 » Tue Jun 27, 2023 9:44 pm

Hello SharkyKZ and thanks a lot for your quick answer! :)
Indeed, you were right. Optimizing the tables did the trick. And yes, the first time I indexed the tables, the finder_links_terms weighed about 97 Mb (I guess something must have gone wrong anyway) !! While now it is only 16 Mb. Which is lighter. While the finder_terms table is 22 Mb big.
Now, I enabled the "search for phrases" indexing option. And everything seems to go plain-sailing. :pop
Thanks so much for your advice.
Regards
Fabrice

User avatar
toivo
Joomla! Master
Joomla! Master
Posts: 17983
Joined: Thu Feb 15, 2007 5:48 am
Location: Sydney, Australia

Re: Joomla 4 Smart search: so disk space consuming?

Post by toivo » Tue Jun 27, 2023 9:49 pm

SharkyKZ wrote: Tue Jun 27, 2023 7:38 amAnd can you clarify if you mean 97 Mb or MB?
@fabio42, as requested by @SharkyKZ, can you please clarify this. There is a big difference between bits and bytes.
Toivo Talikka, Global Moderator

User avatar
fabio42
Joomla! Enthusiast
Joomla! Enthusiast
Posts: 152
Joined: Tue Aug 20, 2019 2:39 pm

Re: Joomla 4 Smart search: so disk space consuming?

Post by fabio42 » Tue Jun 27, 2023 9:56 pm

Seems like I'm discovering lots of things right here! Actually I must have been wrong again, not being tech-savvy.
I checked and it's all about Mib so Mebibytes then. Am I right now ?

User avatar
fabio42
Joomla! Enthusiast
Joomla! Enthusiast
Posts: 152
Joined: Tue Aug 20, 2019 2:39 pm

Re: Joomla 4 Smart search: so disk space consuming?

Post by fabio42 » Tue Jun 27, 2023 10:07 pm

Well, here I am again. Without doing anything, the tables have grown again... no idea why.
The whole finder thing now weighs more than 100 Mib. :eek:
Image

Is it normal?
Lost me :(

User avatar
toivo
Joomla! Master
Joomla! Master
Posts: 17983
Joined: Thu Feb 15, 2007 5:48 am
Location: Sydney, Australia

Re: Joomla 4 Smart search: so disk space consuming?

Post by toivo » Tue Jun 27, 2023 11:27 pm

fabio42 wrote: Tue Jun 27, 2023 9:56 pmI checked and it's all about Mib so Mebibytes then. Am I right now ?
The abbreviation for mebibyte is MiB. There are mebibits, too: Mibit.

This area can be confusing, even for us who started programming in early 1970s in octal and hex code, with just a few KB to play with :D

Ref. 1 Multiple-byte units
Ref. 2 Multiple-bit units
Toivo Talikka, Global Moderator

User avatar
fabio42
Joomla! Enthusiast
Joomla! Enthusiast
Posts: 152
Joined: Tue Aug 20, 2019 2:39 pm

Re: Joomla 4 Smart search: so disk space consuming?

Post by fabio42 » Thu Jun 29, 2023 6:21 am

Still stuck with more than 160 Mib in the finder tables. I'm clueless. Any idea?

User avatar
Nidzo2203
Joomla! Explorer
Joomla! Explorer
Posts: 353
Joined: Sat Nov 21, 2009 4:52 pm
Contact:

Re: Joomla 4 Smart search: so disk space consuming?

Post by Nidzo2203 » Thu Jun 29, 2023 8:40 pm

Don't use Smart search on big sites.

User avatar
fabio42
Joomla! Enthusiast
Joomla! Enthusiast
Posts: 152
Joined: Tue Aug 20, 2019 2:39 pm

Re: Joomla 4 Smart search: so disk space consuming?

Post by fabio42 » Sun Jul 02, 2023 2:45 pm

I'm not sure it's possible to disable Smart Search. If I don't index my contents, then, the search result box displays "no results found" for any word search.
There's no alternative for the smart search component actually. The old search component will be removed in the next versions, so not using smart search is not a satisfying solution...
I'm looking for a reasonable solution that will help me not consume so much disk space. Perhaps that has to do with some tweaking in the component but the options seem limited, or else I'm missing something.

User avatar
Nidzo2203
Joomla! Explorer
Joomla! Explorer
Posts: 353
Joined: Sat Nov 21, 2009 4:52 pm
Contact:

Re: Joomla 4 Smart search: so disk space consuming?

Post by Nidzo2203 » Sun Jul 02, 2023 6:19 pm

I use Google Custom Search for site with over 20.000 articles.

Joomla Smart search is data consuming since was introduced.

Check this Joomla documentation https://docs.joomla.org/Smart_Search_on_large_sites

User avatar
fabio42
Joomla! Enthusiast
Joomla! Enthusiast
Posts: 152
Joined: Tue Aug 20, 2019 2:39 pm

Re: Joomla 4 Smart search: so disk space consuming?

Post by fabio42 » Mon Jul 03, 2023 7:06 am

Thanks for this documentation which explains clearly what smart search implies. I will try out Google custom search, which might be a workaround! (which is quite disappointing for a component presented as a revolution)

User avatar
pe7er
Joomla! Master
Joomla! Master
Posts: 25345
Joined: Thu Aug 18, 2005 8:55 pm
Location: Nijmegen, Netherlands
Contact:

Re: Joomla 4 Smart search: so disk space consuming?

Post by pe7er » Mon Jul 03, 2023 8:26 am

I have worked on large Joomla websites were we have used external services like Elastic Search for indexing.

There are 3rd party extensions available that integrate Elastic Search into your Joomla website. In that case the search index data is not stored in your website but on an Elastic Search server. Elastic Search stores it data in files using key/value which is much faster than using a database.

Your visitors still use Smart Search for searching. The Smart Search component will look up the data on the Elastic Search server.
Kind Regards,
Peter Martin, Global Moderator
Company website: https://db8.nl/en/ - Joomla specialist, Nijmegen, Netherlands
The best website: https://the-best-website.com

User avatar
fabio42
Joomla! Enthusiast
Joomla! Enthusiast
Posts: 152
Joined: Tue Aug 20, 2019 2:39 pm

Re: Joomla 4 Smart search: so disk space consuming?

Post by fabio42 » Mon Jul 03, 2023 5:34 pm

Sounds really interesting and worth trying! Thanks @pe7er!

User avatar
fabio42
Joomla! Enthusiast
Joomla! Enthusiast
Posts: 152
Joined: Tue Aug 20, 2019 2:39 pm

Re: Joomla 4 Smart search: so disk space consuming?

Post by fabio42 » Sun Oct 29, 2023 10:39 am

Hey there!
Coming back after some timeand after trying to see if there was any alternative to the Smart search plugin. Alas, after investigating the different alternative options (Elastic search, Google Search), I must say that even if they sound interesting, they remain pay options.
I just came to wonder that, after clearly admitting that the smart search included in Joomla 4 is really disk consuming for large websites (https://docs.joomla.org/Smart_Search_on_large_sites) - though my website only contains 533 articles (is that supposed to be huge?) - why on earth is nothing done to improve the plugin? (as it seems that Joomla does not benefit from the Smart search on this point, to my mind). The solution suggested in the above article (see link) mentions Joomla Solr as the solution for "larger webistes" but the extension has been unpublished!
So users are left with no other solution since the advent of Joomla 4 but to either disable it :( or find a pay replacement for it. Not sure whether the "old" search plugin is still working... but if it does, for how long? (I had once read it was going to be deprecated as I said in my previous posts, so it was recommended to remove it on a Joomla 4 install, which I sadly did... but perhaps I can try to install it again?).
Well, as you can see, there's not much we can do about it for now, but wait as the article said that this point will be addressed in future versions of Smart search. Can't wait for that.
Meanwhile, which feature(s) would you advise me to tweak in the Smart search parameters to allow a decrease in my finder_links table (which is 100 Mio heavy for the moment)?

Thanks for your help. Any advice is useful for now.
Fabrice

User avatar
DV-admin
Joomla! Enthusiast
Joomla! Enthusiast
Posts: 171
Joined: Wed Sep 20, 2017 7:24 am
Location: Genova
Contact:

Re: Joomla 4 Smart search: so disk space consuming?

Post by DV-admin » Sun Oct 29, 2023 5:29 pm

I have some experience regarding this because I have a much bigger site. I turned off search phrase and keywords. It created too many results. Do not cache the search results either if you have a very busy website. I introduced Jfilters a whole back and don't cache the results. This basically works pretty well. You can think about using sitesearch360 it works with elasticsearch which is useful if you can pay for it.

User avatar
Nidzo2203
Joomla! Explorer
Joomla! Explorer
Posts: 353
Joined: Sat Nov 21, 2009 4:52 pm
Contact:

Re: Joomla 4 Smart search: so disk space consuming?

Post by Nidzo2203 » Sun Oct 29, 2023 8:37 pm

fabio42 wrote: Sun Oct 29, 2023 10:39 am Hey there!
Coming back after some timeand after trying to see if there was any alternative to the Smart search plugin. Alas, after investigating the different alternative options (Elastic search, Google Search), I must say that even if they sound interesting, they remain pay options.
Thanks for your help. Any advice is useful for now.
Fabrice
Google Custom Search is free.

User avatar
fabio42
Joomla! Enthusiast
Joomla! Enthusiast
Posts: 152
Joined: Tue Aug 20, 2019 2:39 pm

Re: Joomla 4 Smart search: so disk space consuming?

Post by fabio42 » Sun Nov 26, 2023 1:58 pm

Thank you @DV-admin for your answer. I did what you suggested but unfortunately, the finder_links table is still 100 Mio, even after optimizing Smart Search. Measn I'll have to explore the Google custom search option, though I'm not a fan of Google's intrusive tools. Better than nothing anyway! (thanks @Nitzo2203!)

factum
Joomla! Apprentice
Joomla! Apprentice
Posts: 29
Joined: Mon Sep 27, 2021 9:35 pm

Re: Joomla 4 Smart search: so disk space consuming?

Post by factum » Mon Nov 27, 2023 12:30 pm

Having a reliable search engine, requires text parsing and indexing.
This has to be done for any information passed (i.e. title, text, custom fields, categories, tags, meta-tags).
It is not an easy task and requires a lot of data to be reliable.
So there is a correlation between accuracy and data size.

You have some options in the Smart Search config > Index tab. Do not forget to re-index your content if you change that.
Also you can define the Content Maps in the Smart Search.

User avatar
fabio42
Joomla! Enthusiast
Joomla! Enthusiast
Posts: 152
Joined: Tue Aug 20, 2019 2:39 pm

Re: Joomla 4 Smart search: so disk space consuming?

Post by fabio42 » Mon Nov 27, 2023 1:29 pm

@factum
Thanks for your answer. Yet, the parameters do not seem to influence the size of the database. They rather influence the efficiency of the search in itself. The documentation mentions one paramater called "Memory table limit", yet, this parameter is absent in the latest version of Joomla 4.
As for the contents map, could you elaborate please?

Advertisement

Locked

Return to “Administration Joomla! 4.x”