Skip to content

Research at St Andrews

On the application of convex transforms to metric search

Research output: Contribution to journalArticlepeer-review

Open Access Status

  • Embargoed (until 8/08/21)

Author(s)

Richard Connor, Alan Dearle, Vladimir Mic, Pavel Zezula

School/Research organisations

Abstract

Scalable similarity search in metric spaces relies on using the mathematical properties of the space in order to allow efficient querying. Most important in this context is the triangle inequality property, which can allow the majority of individual similarity comparisons to be avoided for a given query.

However many important metric spaces, typically those with high dimensionality, are not amenable to such techniques. In the past convex transforms have been studied as a pragmatic mechanism which can overcome this effect; however the problem with this approach is that the metric properties may be lost, leading to loss of accuracy.

Here, we study the underlying properties of such transforms and their effect on metric indexing mechanisms. We show there are some spaces where certain transforms may be applied without loss of accuracy, and further spaces where we can understand the engineering tradeoffs between accuracy and efficiency. We back these observations with experimental analysis. To highlight the value of the approach, we show three large spaces deriving from practical domains whose dimensionality prevents normal indexing techniques, but where the transforms applied give scalable access with a relatively small loss of accuracy.

Close

Details

Original languageEnglish
Pages (from-to)563-570
JournalPattern Recognition Letters
Volume138
Early online date8 Aug 2020
DOIs
Publication statusPublished - Oct 2020

    Research areas

  • Metric search, Contex transform, Metric space

Discover related content
Find related publications, people, projects and more using interactive charts.

View graph of relations

Related by author

  1. BitPart: exact metric search in high(er) dimensions

    Dearle, A. & Connor, R., Jan 2021, In: Information Systems. 95, 14 p., 101493.

    Research output: Contribution to journalArticlepeer-review

  2. Sampled angles in high-dimensional spaces

    Connor, R. & Dearle, A., 2020, Similarity Search and Applications: 13th International Conference, SISAP 2020, Copenhagen, Denmark, September 30–October 2, 2020, Proceedings. Satoh, S., Vadicamo, L., Zimek, A., Carrara, F., Bartolini, I., Aumüller, M., Jónsson, B. Þ. & Pagh, R. (eds.). Cham: Springer, p. 233-247 (Lecture Notes in Computer Science (Information Systems and Applications, incl. Internet/Web, and HCI); vol. 12440).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  3. Modelling string structure in vector spaces

    Connor, R., Dearle, A. & Vadicamo, L., 9 Jul 2019, Proceedings of the 27th Italian Symposium on Advanced Database Systems: Castiglione della Pescaia (Grosseto), Italy, June 16th to 19th, 2019. Mecella, M., Amato, G. & Gennaro, C. (eds.). Sun SITE Central Europe, 12 p. 45. (CEUR Workshop Proceedings; vol. 2400).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  4. Querying metric spaces with bit operations

    Connor, R. & Dearle, A., 2018, Similarity Search and Applications: 11th International Conference, SISAP 2018, Lima, Peru, October 7-9, 2018, Proceedings. Marchand-Maillet, S., Silva, Y. N. & Chávez, E. (eds.). Cham: Springer, p. 33-46 14 p. (Lecture Notes in Computer Science; vol. 11223).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  5. Mediated Information Flow

    Dearle, A. & Connor, R., 17 Mar 2005, IPC No. G06F15/16, Patent No. US2005060380

    Research output: Patent

Related by journal

  1. The Parzen Window method: in terms of two vectors and one matrix

    Mussa, H. Y., Mitchell, J. B. O. & Afzal, A., 1 Oct 2015, In: Pattern Recognition Letters. 63, p. 30-35

    Research output: Contribution to journalArticlepeer-review

ID: 269563545

Top