Useful string utility functions for Go projects. Either because they are faster than the common Go version or do not exist in the standard library.
You can find all details here https://pkg.go.dev/github.com/boyter/go-string
Probably the most useful methods are IndexAll and IndexAllIgnoreCase which for string literal searches should be drop in replacements for regexp.FindAllIndex while totally avoiding the regular expression engine and as such being much faster.
Some quick benchmarks using a simple program which opens a 550MB file and searches over it in memory. Each search is done three times, the first using regexp.FindAllIndex and the second using IndexAllIgnoreCase.
For this specific example the wall clock time to run is at least 10x less, but with the same matching results.
$ ./csperf ſecret 550MB
File length 576683100
FindAllIndex (regex ignore case)
Scan took 25.403231773s 16680
Scan took 25.39742299s 16680
Scan took 25.227218738s 16680
IndexAllIgnoreCase (custom)
Scan took 2.04013314s 16680
Scan took 2.019360935s 16680
Scan took 1.996732171s 16680
The above example in code for you to copy
// Simple test comparison between various search methods
func main() {
arg1 := os.Args[1]
arg2 := os.Args[2]
b, err := os.ReadFile(arg2)
if err != nil {
fmt.Print(err)
return
}
fmt.Println("File length", len(b))
haystack := string(b)
var start time.Time
var elapsed time.Duration
fmt.Println("\nFindAllIndex (regex)")
r := regexp.MustCompile(regexp.QuoteMeta(arg1))
for i := 0; i < 3; i++ {
start = time.Now()
all := r.FindAllIndex(b, -1)
elapsed = time.Since(start)
fmt.Println("Scan took", elapsed, len(all))
}
fmt.Println("\nIndexAll (custom)")
for i := 0; i < 3; i++ {
start = time.Now()
all := str.IndexAll(haystack, arg1, -1)
elapsed = time.Since(start)
fmt.Println("Scan took", elapsed, len(all))
}
r = regexp.MustCompile(`(?i)` + regexp.QuoteMeta(arg1))
fmt.Println("\nFindAllIndex (regex ignore case)")
for i := 0; i < 3; i++ {
start = time.Now()
all := r.FindAllIndex(b, -1)
elapsed = time.Since(start)
fmt.Println("Scan took", elapsed, len(all))
}
fmt.Println("\nIndexAllIgnoreCase (custom)")
for i := 0; i < 3; i++ {
start = time.Now()
all := str.IndexAllIgnoreCase(haystack, arg1, -1)
elapsed = time.Since(start)
fmt.Println("Scan took", elapsed, len(all))
}
}
Note that it performs best with real documents and wost when searching over random data. Depending on what you are searching you may have a similar speed up or a marginal one.
FindAllIndex has a similar speed up,
// BenchmarkFindAllIndex-8 2458844 480.0 ns/op
// BenchmarkIndexAll-8 14819680 79.6 ns/op
See the benchmarks for full proof where they test various edge cases.
The other most useful method is HighlightString. HighlightString takes in some content and locations and then inserts in/out
strings which can be used for highlighting around matching terms. For example you could pass in "test" and have it return "<strong>te</strong>st".
The argument locations accepts output from regexp.FindAllIndex or the included IndexAllIgnoreCase or IndexAll.
All code is licenced under the MIT licence.
This project includes SIMD assembly code (indexbyte2_amd64.s, indexbyte2_arm64.s and
associated Go files) adapted from fzf by Junegunn Choi,
used under the MIT License:
The MIT License (MIT)
Copyright (c) 2013-2026 Junegunn Choi
Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in
all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN
THE SOFTWARE.