提交 · 45890f6d34e70d9dd194bd1729eba3ff72cabf78 · openanolis / cloud-kernel

28 10月, 2015 15 次提交

ARC: mm: HIGHMEM: kmap API implementation · 45890f6d

由 Vineet Gupta 提交于 3月 09, 2015

Implement kmap* API for ARC.

This enables
 - permanent kernel maps (pkmaps): :kmap() API
 - fixmap : kmap_atomic()

We use a very simple/uniform approach for both (unlike some of the other
arches). So fixmap doesn't use the customary compile time address stuff.
The important semantic is sleep'ability (pkmap) vs. not (fixmap) which
the API guarantees.

Note that this patch only enables highmem for subsequent PAE40 support
as there is no real highmem for ARC in pure 32-bit paradigm as explained
below.

ARC has 2:2 address split of the 32-bit address space with lower half
being translated (virtual) while upper half unstranslated
(0x8000_0000 to 0xFFFF_FFFF). kernel itself is linked at base of
unstranslated space (i.e. 0x8000_0000 onwards), which is mapped to say
DDR 0x0 by external Bus Glue logic (outside the core). So kernel can
potentially access 1.75G worth of memory directly w/o need for highmem.
(the top 256M is taken by uncached peripheral space from 0xF000_0000 to
0xFFFF_FFFF)

In PAE40, hardware can address memory beyond 4G (0x1_0000_0000) while
the logical/virtual addresses remain 32-bits. Thus highmem is required
for kernel proper to be able to access these pages for it's own purposes
(user space is agnostic to this anyways).
Signed-off-by: NAlexey Brodkin <abrodkin@synopsys.com>
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

45890f6d

ARC: mm: preps ahead of HIGHMEM support #2 · 6101be5a

由 Vineet Gupta 提交于 10月 28, 2015

Explicit'ify that all memory added so far is low memory
Nothing semantical
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

6101be5a

ARC: mm: preps ahead of HIGHMEM support · 336e2136

由 Vineet Gupta 提交于 3月 05, 2015

Before we plug in highmem support, some of code needs to be ready for it
 - copy_user_highpage() needs to be using the kmap_atomic API
 - mk_pte() can't assume page_address()
 - do_page_fault() can't assume VMALLOC_END is end of kernel vaddr space
Signed-off-by: NAlexey Brodkin <abrodkin@synopsys.com>
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

336e2136

ARC: mm: use generic macros _BITUL()/_AC() · d4084645

由 Alexey Brodkin 提交于 9月 02, 2015

Signed-off-by: NAlexey Brodkin <abrodkin@synopsys.com>
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

d4084645

ARC: mm: Improve Duplicate PD Fault handler · 8840e14c

由 Vineet Gupta 提交于 10月 13, 2015

 - Move the verbosity knob from .data to .bss by using inverted logic
 - No need to readout PD1 descriptor
 - clip the non pfn bits of PD0 to avoid clipping inside the loop
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

8840e14c

V
ARC: Ensure DT mem base is same as what kernel is built with · f759ee57
由 Vineet Gupta 提交于 1月 23, 2015
```
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>
```
f759ee57

ARC: boot: Non Master cpus only need to call EARLY_CPU_SETUP once · 483bcc99

由 Vineet Gupta 提交于 10月 15, 2015

With prev fixes, all cores now start via common entry point @stext which
already calls EARLY_CPU_SETUP for all cores - so no need to invoke it
again
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

483bcc99

ARCv2: smp: [plat-*]: No need to explicitly call mcip_init_smp() · aa0efcde

由 Vineet Gupta 提交于 10月 12, 2015

MCIP now registers it's own per cpu setup routine (for IPI IRQ request)
using smp_ops.init_irq_cpu().

So no need for platforms to do that. This now completely decouples
platforms from MCIP.
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

aa0efcde

ARC: smp: Introduce smp hook @init_irq_cpu called for all cores · 286130eb

由 Vineet Gupta 提交于 10月 14, 2015

Note this is not part of platform owned static machine_desc,
but more of device owned plat_smp_ops (rather misnamed) which a IPI
provider or some such typically defines.

This will help us seperate out the IPI registration from platform
specific init_cpu_smp() into device specific init_irq_cpu()
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

286130eb

V
ARC: smp: Rename platform hook @init_smp -> @init_cpu_smp · 8721a7f5
由 Vineet Gupta 提交于 10月 13, 2015
```
This conveys better that it is called for each cpu
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>
```
8721a7f5

ARCv2: smp: [plat-*]: No need to explicitly call mcip_init_early_smp() · 26b8f996

由 Vineet Gupta 提交于 10月 12, 2015

MCIP now registers it's own probe callback with smp_ops.init_early_smp()
which is called by ARC common code, so no need for platforms to do that.

This decouples the platforms and MCIP and helps confine MCIP details
to it's own file.
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

26b8f996

ARC: smp: Introduce smp hook @init_early_smp for Master core · e55af4da

由 Vineet Gupta 提交于 10月 12, 2015

This adds a platform agnostic early SMP init hook which is called on
Master core before calling setup_processor()

  setup_arch()
     smp_init_cpus()
         smp_ops.init_early_smp()
     ...
     setup_processor()

How this helps:
 - Used for one time init of certain SMP centric IP blocks, before
   calling setup_processor() which probes various bits of core,
   possibly including this block

 - Currently platforms need to call this IP block init from their
   init routines, which doesn't make sense as this is specific to ARC
   core and not platform and otherwise requires copy/paste in all
   (and hence a possible point of failure)

e.g. MCIP init is called from 2 platforms currently (axs10x and sim)
which will go away once we have this.

This change only adds the hooks but they are empty for now. Next commit
will populate them and remove the explicit init calls from platforms.
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

e55af4da

ARC: remove @init_time, @init_irq platform callbacks · 4c82f286

由 Vineet Gupta 提交于 10月 13, 2015

These are not in use for ARC platforms. Moreover DT mechanims exist to
probe them w/o explicit platform calls.

 - clocksource drivers can use CLOCKSOURCE_OF_DECLARE()
 - intc IRQCHIP_DECLARE() calls + cascading inside DT allows external
   intc to be probed automatically
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

4c82f286

ARC: smp: irqchip: handle IPI as percpu irq like timer · e0868e6f

由 Vineet Gupta 提交于 10月 12, 2015

The reason this was not done so far was lack of genuine IPI_IRQ for
ARC700, as we don't have a SMP version of core yet (which might change
soon thx to EZChip). Nevertheles to increase the build coverage, we
need to allow CONFIG_SMP for ARC700 and still be able to run it on a
UP platform (nsim or AXS101) with a UP Device Tree (SMP-on-UP)

The build itself requires some define for IPI_IRQ and even a dummy
value is fine since that code won't run anyways.
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

e0868e6f

ARC: boot: Support Halt-on-reset and Run-on-reset SMP booting modes · 3971cdc2

由 Vineet Gupta 提交于 10月 09, 2015

For Run-on-reset, non masters need to spin wait. For Halt-on-reset they
can jump to entry point directly.

Also while at it, made reset vector handler as "the" entry point for
kernel including host debugger based boot (which uses the ELF header
entry point)
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

3971cdc2

17 10月, 2015 14 次提交

ARC: smp: Move default boot kick/wait code out of MCIP into common code · f33e9c43

由 Vineet Gupta 提交于 10月 09, 2015

For non halt-on-reset case, all cores start of simultaneously in @stext.
Master core0 proceeds with kernel boot, while other spin-wait on
@wake_flag being set by master once it is ready. So NO hardware assist
is needed for master to "kick" the others.

This patch moves this soft implementation out of mcip.c (as there is no
hardware assist) into common smp.c
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

f33e9c43

V
ARC: boot log: decode more mmu config items · d0890ea5
由 Vineet Gupta 提交于 10月 02, 2015
```
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>
```
d0890ea5
V
ARC: boot log: move helper macros to header for reuse · 964cf28f
由 Vineet Gupta 提交于 10月 02, 2015
```
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>
```
964cf28f

ARC: mm: compute TLB size as needed from ways * sets · b598e17f

由 Vineet Gupta 提交于 10月 02, 2015

This frees up some bits to hold more high level info such as PAE being
present, w/o increasing the size of already bloated cpuinfo struct
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

b598e17f

V
ARC: mm: MMU v1..v3 only selectable for ARCompact ISA based cores · c583ee4f
由 Vineet Gupta 提交于 9月 29, 2015
```
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>
```
c583ee4f

ARC: make write_aux_reg safer against macro substitution · 5c35ee64

由 Vineet Gupta 提交于 9月 29, 2015

It was generating warnings when called as write_aux_reg(x, paddr >> 32)
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

5c35ee64

ARC: [arcompact] entry.S: Elide extra check/branch in exception ret path · 9fabcc63

由 Vineet Gupta 提交于 10月 08, 2015

This is done by improving the laddering logic !

Before:

   if Exception
      goto excep_or_pure_k_ret

   if !Interrupt(L2)
      goto l1_chk
   else
      INTERRUPT_EPILOGUE 2

 l1_chk:
   if !Interrupt(L1)  (i.e. pure kernel mode)
      goto excep_or_pure_k_ret
   else
      INTERRUPT_EPILOGUE 1

 excep_or_pure_k_ret:
   EXCEPTION_EPILOGUE

Now:

   if !Interrupt(L1 or L2) (i.e. exception or pure kernel mode)
      goto excep_or_pure_k_ret

  ; guaranteed to be an interrupt
   if !Interrupt(L2)
      goto l1_ret
   else
      INTERRUPT_EPILOGUE 2

 ; by virtue of above, no need to chk for L1 active
 l1_ret:
    INTERRUPT_EPILOGUE 1

 excep_or_pure_k_ret:
    EXCEPTION_EPILOGUE
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

9fabcc63

V
ARC: [arcompact] entry.S: Document preemption games for L2 intr · 5f888087
由 Vineet Gupta 提交于 9月 06, 2015
```
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>
```
5f888087

ARC: [arcompact] entry.S: Improve early return from exception · 55a2ae77

由 Vineet Gupta 提交于 9月 05, 2015

The requirement is to
 - Reenable Exceptions (AE cleared)
 - Reenable Interrupts (E1/E2 set)

We need to do wiggle these bits into ERSTATUS and call RTIE.

Prev version used the pre-exception STATUS32 as starting point for what
goes into ERSTATUS. This required explicit fixups of U/DE/L bits.

Instead, use the current (in-exception) STATUS32 as starting point.
Being in exception handler U/DE/L can be safely assumed to be correct.
Only AE/E1/E2 need to be fixed.

So the new implementation is slightly better
 -Avoids read form memory
 -Is 4 bytes smaller for the typical 1 level of intr configuration
 -Depicts the semantics more clearly
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

55a2ae77

ARC: [arcompact] don't check for hard isr calling local_irq_enable() · 9dbd3d9b

由 Vineet Gupta 提交于 9月 05, 2015

Historically this was done by ARC IDE driver, which is long gone.
IRQ core is pretty robust now and already checks if IRQs are enabled
in hard ISRs. Thus no point in checking this in arch code, for every
call of irq enabled.

Further if some driver does do that - let it bring down the system so we
notice/fix this sooner than covering up for sucker

This makes local_irq_enable() - for L1 only case atleast simple enough
so we can inline it.
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

9dbd3d9b

V
ARCv2: mm: THP: flush_pmd_tlb_range make SMP safe · c7119d56
由 Vineet Gupta 提交于 10月 15, 2015
```
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>
```
c7119d56

ARCv2: mm: THP: Implement flush_pmd_tlb_range() optimization · 722fe8fd

由 Vineet Gupta 提交于 2月 27, 2015

Implement the TLB flush routine to evict a sepcific Super TLB entry,
vs. moving to a new ASID on every such flush.
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

722fe8fd

V
ARCv2: mm: THP: boot validation/reporting · 6ce18798
由 Vineet Gupta 提交于 3月 12, 2015
```
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>
```
6ce18798

ARCv2: mm: THP support · fe6c1b86

由 Vineet Gupta 提交于 7月 08, 2014

MMUv4 in HS38x cores supports Super Pages which are basis for Linux THP
support.

Normal and Super pages can co-exist (ofcourse not overlap) in TLB with a
new bit "SZ" in TLB page desciptor to distinguish between them.
Super Page size is configurable in hardware (4K to 16M), but fixed once
RTL builds.

The exact THP size a Linx configuration will support is a function of:
 - MMU page size (typical 8K, RTL fixed)
 - software page walker address split between PGD:PTE:PFN (typical
   11:8:13, but can be changed with 1 line)

So for above default, THP size supported is 8K * 256 = 2M

Default Page Walker is 2 levels, PGD:PTE:PFN, which in THP regime
reduces to 1 level (as PTE is folded into PGD and canonically referred
to as PMD).

Thus thp PMD accessors are implemented in terms of PTE (just like sparc)
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

fe6c1b86

09 10月, 2015 3 次提交

ARC: mm: Introduce PTE_SPECIAL · 24830fc7

由 Vineet Gupta 提交于 2月 16, 2015

Needed for THP, but will also come in handy for fast GUP later
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

24830fc7

V
ARC: mm: pte flags comsetic cleanups, comments · 129cbed5
由 Vineet Gupta 提交于 12月 05, 2013
```
No semantical changes
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>
```
129cbed5

ARC: mm: switch pgtable_to to pte_t * · e8a75963

由 Vineet Gupta 提交于 8月 28, 2015

ARC is the only arch with unsigned long type (vs. struct page *).
Historically this was done to avoid the page_address() calls in various
arch hooks which need to get the virtual/logical address of the table.

Some arches alternately define it as pte_t *, and is as efficient as
unsigned long (generated code doesn't change)
Suggested-by: NKirill A. Shutemov <kirill.shutemov@linux.intel.com>
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

e8a75963

16 9月, 2015 1 次提交

genirq: Remove irq argument from irq flow handlers · bd0b9ac4

由 Thomas Gleixner 提交于 9月 14, 2015

Most interrupt flow handlers do not use the irq argument. Those few
which use it can retrieve the irq number from the irq descriptor.

Remove the argument.

Search and replace was done with coccinelle and some extra helper
scripts around it. Thanks to Julia for her help!
Signed-off-by: NThomas Gleixner <tglx@linutronix.de>
Cc: Julia Lawall <Julia.Lawall@lip6.fr>
Cc: Jiang Liu <jiang.liu@linux.intel.com>

bd0b9ac4

12 9月, 2015 1 次提交

ARCv2: [axs103_smp] Reduce clk for SMP FPGA configs · 3ebb0540

由 Vineet Gupta 提交于 9月 11, 2015

Newer bitfiles needs the reduced clk even for SMP builds

Cc: <stable@vger.kernel.org>  #4.2
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>
Signed-off-by: NLinus Torvalds <torvalds@linux-foundation.org>

3ebb0540

27 8月, 2015 6 次提交

V
ARCv2: entry: Fix reserved handler · 3d592659
由 Vineet Gupta 提交于 8月 27, 2015
```
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>
```
3d592659

ARCv2: perf: Finally introduce HS perf unit · 9b28829d

由 Vineet Gupta 提交于 11月 18, 2014

With all features in place, the ARC HS pct block can now be effectively
allowed to be probed/used
Acked-by: NPeter Zijlstra <peterz@infradead.org>
Cc: Arnaldo Carvalho de Melo <acme@kernel.org>
Signed-off-by: NAlexey Brodkin <abrodkin@synopsys.com>
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

9b28829d

ARCv2: perf: SMP support · e525c37f

由 Alexey Brodkin 提交于 8月 24, 2015

* split off pmu info into singleton and per-cpu bits
* setup PMU on all cores
Acked-by: NPeter Zijlstra <peterz@infradead.org>
Cc: Arnaldo Carvalho de Melo <acme@kernel.org>
Signed-off-by: NAlexey Brodkin <abrodkin@synopsys.com>
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

e525c37f

ARCv2: perf: implement exclusion of event counting in user or kernel mode · e6b1d126

由 Alexey Brodkin 提交于 8月 24, 2015

Acked-by: NPeter Zijlstra <peterz@infradead.org>
Cc: Arnaldo Carvalho de Melo <acme@kernel.org>
Signed-off-by: NAlexey Brodkin <abrodkin@synopsys.com>
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

e6b1d126

ARCv2: perf: Support sampling events using overflow interrupts · 36481cf7

由 Alexey Brodkin 提交于 8月 24, 2015

In times of ARC 700 performance counters didn't have support of
interrupt an so for ARC we only had support of non-sampling events.

Put simply only "perf stat" was functional.

Now with ARC HS we have support of interrupts in performance counters
which this change introduces support of.

ARC performance counters act in the following way in regard of
interrupts generation.
 [1] A counter counts starting from value set in PCT_COUNT register pair
 [2] Once counter reaches value set in PCT_INT_CNT interrupt is raised

Basic setup look like this:
 [1] PCT_COUNT = 0;
 [2] PCT_INT_CNT = __limit_value__;
 [3] Enable interrupts for that counter and let it run
 [4] Let counter reach its limit
 [5] Handle interrupt when it happens

Note that PCT HW block is build in CPU core and so ints interrupt
line (which is basically OR of all counters IRQs) is wired directly to
top-level IRQC. That means do de-assert PCT interrupt it's required to
reset IRQs from all counters that have reached their limit values.
Acked-by: NPeter Zijlstra <peterz@infradead.org>
Cc: Arnaldo Carvalho de Melo <acme@kernel.org>
Signed-off-by: NAlexey Brodkin <abrodkin@synopsys.com>
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

36481cf7

ARCv2: perf: implement "event_set_period" · 1fe8bfa5

由 Alexey Brodkin 提交于 8月 24, 2015

This generalization prepares for support of overflow interrupts.

Hardware event counters on ARC work that way:
Each counter counts from programmed start value (set in
ARC_REG_PCT_COUNT) to a limit value (set in ARC_REG_PCT_INT_CNT) and
once limit value is reached this timer generates an interrupt.

Even though this hardware implementation allows for more flexibility,
in Linux kernel we decided to mimic behavior of other architectures
this way:

 [1] Set limit value as half of counter's max value (to allow counter to
     run after reaching it limit, see below for more explanation):
 ---------->8-----------
 arc_pmu->max_period = (1ULL << counter_size) / 2 - 1ULL;
 ---------->8-----------

 [2] Set start value as "arc_pmu->max_period - sample_period" and then
count up to the limit

Our event counters don't stop on reaching max value (the one we set in
ARC_REG_PCT_INT_CNT) but continue to count until kernel explicitly
stops each of them.

And setting a limit as half of counter capacity is done to allow
capturing of additional events in between moment when interrupt was
triggered until we're actually processing PMU interrupts. That way
we're trying to be more precise.

For example if we count CPU cycles we keep track of cycles while
running through generic IRQ handling code:

 [1] We set counter period as say 100_000 events of type "crun"
 [2] Counter reaches that limit and raises its interrupt
 [3] Once we get in PMU IRQ handler we read current counter value from
ARC_REG_PCT_SNAP ans see there something like 105_000.

If counters stop on reaching a limit value then we would miss
additional 5000 cycles.
Acked-by: NPeter Zijlstra <peterz@infradead.org>
Cc: Arnaldo Carvalho de Melo <acme@kernel.org>
Signed-off-by: NAlexey Brodkin <abrodkin@synopsys.com>
Signed-off-by: NVineet Gupta <vgupta@synopsys.com>

1fe8bfa5

openanolis / cloud-kernel 1 年多 前同步成功

openanolis / cloud-kernel
1 年多前同步成功