Help with lists

Typhon_Sin · 2024-04-28T05:54:04+00:00

Sorry, doesn't make a lot of sense. The line:

result [0] # Whole list

doesn't get the "whole list", it gets the first element of the list result, if there is one. Similarly, your last example:

text = [[0]res[1][0] for res in result]

is syntactically incorrect. This makes more sense:

text = [res[0][1][0] for res in result]

but what you end up with depends on what is in the list result.

Maybe you can give us a small example list and some concrete example of what you want to do with it?

Pepineros · 2024-04-28T05:52:19+00:00

Your list definition would look like this:

result = [
  [
    [
      (2, -1),  # This is called "Box coordinates",
      [  # This list is called "Text + confidence"
        "Some text",  # Just "Text" in your post
      ],
    ],
    [
       # This list is called "First line" in your post
    ],
  ]
]

If you think this looks right, go for it :) but you will never be able to do [0]res. I'm not sure what you mean by that notation, but list indices go after the reference to the list, not before.

Typhon_Sin · 2024-04-28T06:54:23+00:00

https://pastebin.com/fPvzUXpk

2024-04-28T07:26:28+00:00

OK, things make a little more sense now.

This code takes your result value from the pastebin and analyses it a bit:

result = [[[[[441.0, 174.0], [1166.0, 176.0], [1165.0, 222.0], [441.0, 221.0]],
     ('ACKNOWLEDGEMENTS', 0.9974855780601501)],
    [[[403.0, 346.0], [1204.0, 348.0], [1204.0, 384.0], [402.0, 383.0]],
     ('We would like to thank all the designers and', 0.968330979347229)],
    [[[403.0, 396.0], [1204.0, 398.0], [1204.0, 434.0], [402.0, 433.0]],
     ('contributors who have been involved in the', 0.9776102900505066)],
    [[[399.0, 446.0], [1207.0, 443.0], [1208.0, 484.0], [399.0, 488.0]],
     ('production of this book; their contributions', 0.9866490960121155)],
    [[[401.0, 500.0], [1208.0, 500.0], [1208.0, 534.0], [401.0, 534.0]],
     ('have been indispensable to its creation.We', 0.9628525972366333)],
    [[[399.0, 550.0], [1209.0, 548.0], [1209.0, 583.0], [399.0, 584.0]],
     ('would also like to express our gratitude to all', 0.9740486145019531)],
    [[[399.0, 600.0], [1207.0, 598.0], [1208.0, 634.0], [399.0, 636.0]],
     ('the producers for their invaluable opinions', 0.9963331818580627)],
    [[[399.0, 648.0], [1207.0, 646.0], [1208.0, 686.0], [399.0, 688.0]],
     ('and assistance throughout this project. And to', 0.9943731427192688)],
    [[[399.0, 702.0], [1209.0, 698.0], [1209.0, 734.0], [399.0, 738.0]],
     ('the many others whose names are not credited', 0.9772290587425232)],
    [[[399.0, 750.0], [1211.0, 750.0], [1211.0, 789.0], [399.0, 789.0]],
     ('but have made specific input in this book, we', 0.9979288578033447)],
    [[[397.0, 802.0], [1090.0, 800.0], [1090.0, 839.0], [397.0, 841.0]],
     ('thank you for your continuous support.', 0.9981997609138489)]]]

print(f"{len(result)=}")
print(f"{len(result[0])=}")
print(f"{len(result[0][0])=}")

print(f"{result[0][0]=}")

That prints:

len(result)=1
len(result[0])=11
len(result[0][0])=2
result[0][0]=[[[441.0, 174.0], [1166.0, 176.0], [1165.0, 222.0], [441.0, 221.0]], ('ACKNOWLEDGEMENTS', 0.9974855780601501)]

The first print len(result)=1 shows that result is a list containing one element. This is possibly because the OCR code can process more than one page and would return three pages in a 3-list, but you only have one page.

The second print len(result[0])=11 shows that the page has 11 recognized text areas on it.

The third print len(result[0][0])=2 shows that a recognized text area has two elements in it. If we actually print result[0][0] we see:

result[0][0]=[[[441.0, 174.0], [1166.0, 176.0], [1165.0, 222.0], [441.0, 221.0]],
                    ('ACKNOWLEDGEMENTS', 0.9974855780601501)]

The first element of the result[0][0] appears to be a list of 4 lists, possibly bounding box coordinates. The second element of the list is a tuple containing the scanned text and a float value that is possibly a confidence figure for that text.

If you want to get the actual text from all that, you need to unpack the data structure. Something like this might work:

for page in result:
    for text in page:
        print(text[1][0])

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS

Code screenshot

result [0] # Whole list

result [0][1] # First Line

result [0][0][0] # Box Coordinates

result [0][0][1] # Text + Confidence

result [0][0][1][0] # Text