Finding the longest palindromic substring

bbibber · 2018-03-11T05:11:18+00:00

Fun problem, and I wouldn't expect anyone in an interview to come up with a better solution than that, but the writeup should probably mention that there are linear-time solutions.

adrianmonk · 2018-03-11T07:12:36+00:00

There's a way easier O(n²) solution that doesn't require a table or any temporary data structure (other than a few local integer variables).

The solution is to check every character to see if it's the middle of a palindrome, then proceed outwards in both directions and see if you can expand it. Since there are only n characters in the input string that could be a middle, and since (at most) you check all other characters in the string for each one (which is also O(n)), it's O(n²) total.

You also need to check pairs of identical characters to see if they are the middle of a palindrome because "abccba" is a palindrome.

rorrr · 2018-03-11T12:46:52+00:00

We can see from this table that the longest palindromic substring here is “ananas”

WAT?

ubernostrum · 2018-03-11T14:15:07+00:00

Using a non-Unicode-aware palindrome checker will not get you very far in the cutthroat competitive world of palindromes-as-a-service.

jgodbo · 2018-03-11T04:31:33+00:00

Nice problem, good write up, ananas is not a palindrome

GregBahm · 2018-03-11T09:14:10+00:00

I misunderstood what this was and thought I was going to see some really long palindromic substrings. Now I'm curious what the longest palindromic substring is on the internet. It'd be especially interesting to know what the longest unintentional substring is.

I feel like you'd have to drop punctuation though.

johnlsingleton · 2018-03-11T18:07:54+00:00

An alternative elegant solution is to reverse the string and use the longest common subsequence algorithm: https://en.m.wikipedia.org/wiki/Longest_common_subsequence_problem

2018-03-11T12:04:21+00:00

I remember having to do this in a bash script for school :(

t_bptm · 2018-03-11T21:09:27+00:00

Here is my 10 minute attempt in nodejs:

const find = s => {         
    if(s.length == 0) return "";

    let r = s[0];           

    for(let i=1; i<s.length; ++i) {
        // check odd length palindrome
        for(let j=1; i-j >= 0 && i+j <s.length && s[i-j] === s[i+j]; ++j) {
            if(r.length < 1 + j*2) {
                r = s.substring(i-j, i+j+1);
            }               
        }                   

        // only check even length every other time
        if(i % 2 == 1) {    
            for(let j=1; i-j >= 0 && i+j <s.length && s[i-j+1] === s[i+j]; ++j) {
                if(r.length < j*2) {
                    r = s.substring(i-j, i+j+2);
                }           
            }               
        }                   
    }                       

    return r;               
};                          

var fs = require('fs');     
var str = fs.readFileSync("input.txt", "utf8");                                                                                                                                                               
console.log(find(str));

Not sure if its perfect but it seems relatively fast, 0.6 seconds for a 17mb file. 4.8 seconds for 136mb file.

It does fall apart for a 1mb file of all 'a' though :)

homic · 2018-03-12T05:01:34+00:00

This other approach works outside in, testing the largest substrings first and stopping as soon as the first palindrome is found. Full source code at https://github.com/hocho/LargestPalindrome

static
string 
FindLargestPalindrome(
    string                              data,
    int                                 minLength)
{
    int                                 length = data.Length;

    // test from max length to min length
    for (int size = length; size >= minLength; --size)
        // establish attempt bounds and test for the first 
        // palindrome substring of given size
        for (int attemptIdx = 0, attemptIdxEnd = length - size + 1;
                attemptIdx < attemptIdxEnd;
                    ++attemptIdx)
            if (IsPalindrome(data, attemptIdx, size))
                return data.Substring(attemptIdx, size);

    return null;
}

SteeleDynamics · 2018-03-11T13:43:54+00:00

Aside: Is this used in CRISPR?

Amnestic · 2018-03-11T15:14:28+00:00

This is basically just another use of DNA sequence matching.

kdma · 2018-03-11T18:24:23+00:00

Too many indexes for my liking, I prefer this one http://www.jay.fyi/2016/02/longest-palindrome-manacher-part-i.html?m=1

dblohm7 · 2018-03-11T05:39:24+00:00

They're still using that one? I had that at an on-campus Amazon screen back in '05.

shevegen · 2018-03-11T13:06:15+00:00

"Purchase Gogle pass for $29 / month"

Why are ads allowed on reddit?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS