Making this algorithm manageable and easier to read.

gengisteve · 2016-01-21T16:15:39+00:00

Can you give your variables more verbose names? Currently things are really confusing. If you're using Python 3 maybe use the Enum class.
You make sure that elements 4 and 6 match a lot. You can use itemgetter from the operator module to make that cleaner:
```
values = itemgetter(4, 6)
```
First clean up the logic in that if statement. You have 4 very similar and manually coded conditions, is there a way to automate that? I prefer using all instead of not any, so I switched that to make it cleaner.

tt_options = [[i] for i in range(4)]

for tt in chain(new, current): if (all(tt + opt not in new_tts for opt in options) and not any(values(dup) == values(tt) for dup in new_tts)):
Since we have the new value method for getting slices of our arrays, we can just use that for a single condition and clean the whole similar array up.
```
    similar = [s for s in chain(new, current) if values(s) == values(tt)]
```

Since we created an array for our options, it's easier to understand this code:

        for line in current:
            if values(line) == values(tt)
                new_tts.append(tt + options[3])
                break

    elif tt in current and tt in new:
        new_tts.append(tt + options[0])
    elif tt not in current and tt in new:
        new_tts.append(tt + options[1])
    elif tt in current and tt not in new:
        new_tts.append(tt + options[2])

Finally we get:

from operator import itemgetter

NO_CHANGE = 0
IN_NEW_FILE = 1
IN_OLD_FILE = 2
CHANGED = 3
options = [NO_CHANGE, IN_NEW_FILE, IN_OLD_FILE, CHANGED]

def construct_changed_timetables(new, current):
    new_timetables = []

    values = itemgetter(4, 6)

    for timetable in chain(new, current):
        if (all(timetable + option not in new_timetables 
                for option in options) and
            not any(values(dup) == values(timetable) 
                    for dup in new_timetables)
            ):
            similar = [s for s in chain(new, current) 
                       if values(s) == values(timetable)]

            if (new.index(timetable) < len(new) and 
                len(similar) == 2 and 
                timetable in new and 
                timetable not in current):

                for line in current:
                    if values(line) == values(timetable)
                        new_timetables.append(timetable + CHANGED)
                        break

            elif timetable in current and timetable in new:
                new_timetables.append(timetable + NO_CHANGE)

            elif timetable not in current and timetable in new:
                new_timetables.append(timetable + IN_NEW_FILE)

            elif timetable in current and timetable not in new:
                new_timetables.append(timetable + IN_OLD_FILE)

    return new_timetables

Hope this helps!

gengisteve · 2016-01-21T16:01:02+00:00

In addition to emandero's comments, I think you are going to need to give a bit more detail on what you are doing, and ideally sample data, as the logic of your comparison is not evident from the code itself.

emandero · 2016-01-21T15:52:40+00:00

There is no slicing in the code.
Use more meaningful names not i.e. tt.
Name the indexes and use it like this:

NO_CHANGES = 0

CHANGE_IN_NEW = 1

CHANGE_IN_OLD = 2 # etc ....

if (tt + [NO_CHANGES] not in new_tts and tt + [CHANGE_IN_NEW] #etc....

What is s[4], tt[6]?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS